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An Automated 20-20,000-cps 
Transmission Measuring Set 
for Laboratory Use 


By G. D. HAYNIE and P. E. ROSENFELD 
(Manuscript received February 27, 1963) 


An automated loss and phase measuring set has been developed to measure, 
in a point-by-point fashion, two-port networks over the frequency range 
from 20 to 20,000 cps with a maximum accuracy of 0.01 db and 0.1 degree. 
Manual, semiautomatic, and automatic modes of operation are provided. 
During automatic operation, the interval between successive data points is 
determined by programmed intervals of frequency, loss, and phase. These 
quantities are programmed either separately or in combination. Machine 
selection of the measurement points is controlled by comparing the program 
with information fed back from the signal oscillator and the measured cir- 
cult. At each selected point, frequency 1s measured, loss and phase are 
determined by self-balancing standards, and these data are recorded on 
punched tape. 


I, INTRODUCTION 


Automatic measurement techniques have been widely and successfully 
applied in situations where many repetitive measurements of a similar 
type are required. In many cases the measuring instruments are pro- 
grammed for a specific measurement or sequence of measurements, and 
versatility is of minor importance. This paper describes an automatic 
transmission measuring set that was developed for laboratory use in the 
20- to 20,000-cps frequency range to measure two-port networks having 
a wide variety of characteristics. 
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The first section of the paper specifies the quantities that are measured 
and the range and accuracy of the measurements, and lists the functions 
that the skilled operator of a manual measuring set must perform when 
making laboratory measurements. In Section II some of the factors in- 
volved in data taking are discussed, and the chosen method of automatic 
data selection is described. Section III describes, in broad terms, the 
measuring and control circuits used. Operation of the automated set in 
the manual, semiautomatic, and automatic modes is described in Section 
IV. A more detailed description of the measuring and control circuits, 
along with a description of some of the more important subsystems, is 
given in Section V. In Section VI some of the most important systems 
considerations which influenced the measuring set design are discussed. 
Section VII gives the results of tests made to confirm the measuring set 
accuracy and to evaluate, using selected networks, the operation of the 
automatic control and data selection functions of the measuring set. 


1.1 Required Measurements, Accuracy and Ranges 


The basic quantities measured are the insertion loss and phase shift 
of a two-port network as a function of frequency when measured between 
600-ohm unbalanced impedances.! The measurement frequency is con- 
tinuously variable from 20 to 20,000 cps and adjustable to +-0.1 eps. 
The input power level to the measured network is variable in 5-db steps 
from +10 dbm to —20 dbm, each step being accurate to +0.25 db. The 
range of loss measurement is from 0 to 120 db with an accuracy of 0.01 
db for losses less than 40 db. Phase shift is measured from 0 to 360° with 
an accuracy of +0.1° for losses less than 40 db. Measurements can be 
made at impedances other than 600 ohms by changing a pair of plug-in 
terminations, but the oscillator output level will not be calibrated, and 
the loss and phase measurements may not be direct reading. 


1.2 Function of the Measuring Set 


Measurements are required for adjustment of networks and for obtain- 
ing data which can be processed by a computer or recorded in tabular or 
graphical form. In obtaining data with a manually operated set, the 
operator must choose and select the measurement frequencies, make the 
loss and phase measurement, and record the data. Any fully automatic 
set must also carry out these functions and, in particular, must provide 
a way of introducing the necessary inputs into the set so that the re- 
quired data will be obtained. 
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II, SELECTION OF DATA POINTS 


The a priori knowledge of the behavior of the networks being measured 
will not always be sufficient to enable the network designer to preselect 
the frequencies at which the measurements should be made in order that 
no important details be missed. If the frequency interval between suc- 
cessive measurements is made small, all the important details will be de- 
tected, but this would increase the over-all time required to make the 
measurements and give superfluous data. Unneeded data may not be ob- 
jectionable when recorded in graphical form, but stored digital data 
should be kept to a minimum. In order to select only meaningful data on 
a network with uncertain characteristics, some form of feedback is needed 
when selecting the data points. When making manual measurements, a 
skilled operator supplies this feedback. 

To illustrate several criteria by which a machine can select data points, 
an arbitrary loss characteristic is illustrated in Fig. 1(a). If a certain fre- 
quency interval, Af, were to be chosen as the interval between measured 
points, important parts of the characteristic would be missed if a smooth 
curve were drawn through the points taken as shown in Fig. 1(b). Ap- 
plication of this criterion does not require any feedback from the meas- 
ured network. 

If the same characteristic were to be measured by using a selected in- 
crement of loss, AL, to determine the measured points, the characteristic 
would be completely determined within +AZ. This is shown in Fig. 1(c). 
The loss interval criterion does require feedback. It can be seen that the 
effect of the frequency “‘comb”’ (Af criterion) is that fine structure may 
be missed, but coarse structure is reproduced pretty well. The effect of 
the loss ‘“comb”’ is to reproduce fine structure well, but large area errors 
may occur when coarse structure is measured. 

Application of the frequency and loss ‘‘combs” together results in a 
much better reproduction in general than either can do alone. Fig. 1 (d) 
shows a smooth curve drawn through the measured points taken by the 
use of both criteria simultaneously. The same remarks hold for phase 
measurements in which a selected increment of phase, AQ, is used to select 
measured points. 

In the general measurement case, the criteria used to select data points 
in one part of the frequency range may not be the desirable criteria for 
another part of the frequency range. The bandpass filter shown in Fig. 2 
may be used to illustrate this. Typically, one would be interested only in 
the coarse loss structure in the stop bands and what the minimum loss is 
to within perhaps 3 db. Another item of interest might be the loss slope 
in the transition regions or fine variations of the loss and phase in the 
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(a) 
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Fig. 1 — Criteria for data selection. 


passband, because these characteristics influence carrier transmission. 
Thus, the choice of which criteria to use and the size of the chosen inter- 
vals will vary in different frequency bands. 

In the data selection method adopted, a program control is provided 
which permits Af, AL, and Aé intervals to be set up independently in each 
of 5 frequency bands. The Af intervals are 1000, 300 and 100 eps; the 
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Fig. 2— Bandpass filter transmission characteristic. 


AL intervals are 10, 3, 1, 0.3, 0.1 and 0.03 db; and the A@ intervals are 
30, 10, 3, 1 and 0.38 degrees. The edges of the bands can be set to any 
arbitrary frequency in the 20- to 20,000-cps range to the nearest 10 eps. 


III. BLOCK DIAGRAM OF THE TRANSMISSION MEASURING SET 


The block diagram of the measuring set can be conveniently divided 
into two parts, the measuring circuit and the control circuit. 


3.1 Measuring Circuit 


A simplified block diagram of the measuring circuit is shown in Fig. 8. 
The signal oscillator, adjusted to the measurement frequency F, is ap- 
plied to the unknown and to the loss standard. As the sampling switch 81 
rapidly compares the outputs from the two paths, any difference in trans- 
mission between the unknown and the loss standard will result in varia- 
tions of the measurement frequency level and phase at the switching rate. 


2506 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


SIGNAL 
OSCILLATOR 







LOSS 
STANDARD 


REFERENCE 
PATH 
CONVERTER 


Fig. 3 — Simplified block diagram — measuring circuit. 


These level and phase variations at frequency F are linearly translated to 
level and phase variations at the fixed frequency A by the measurement 
path converter. The level differences at the converter output are detected 
by the loss detector and indicated by proportional deflections on the loss 
detector meter. When the loss standard is adjusted to have loss equal to 
that of the unknown, the loss detector meter will indicate a null. The loss 
measurement is then completed by recording the setting of the loss stand- 
ard. 

For the phase measurement, a second path is needed to provide a refer- 
ence signal for the phase detector. The output from the measurement 
path converter is applied to the phase standard and the strapped path, 
and switch S2 is switched synchronously with 8, . The phase standard is 
adjusted until the phase at S2 is the same for both switch positions, this 
condition being indicated by a null on the phase detector meter. If the 
phase of the loss standard is negligible, the phase of the unknown will 
then be equal to the phase of the phase standard. The frequency counter 
measures the signal oscillator frequency, and recording the setting of the 
phase standard and frequency counter completes the measurement of a 
point. 


3.2 Control Circuit 


In Fig. 4 the control circuit is added to the measurement circuit. Self- 
balancing controls for the loss and phase standards have been added along 
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with automatic readout. Automation of the set is accomplished by the 
addition of the over-all control and the oscillator sweep control. The over- 
all control consists of selector switches for setting up the measurement 
program; a range control circuit to sense the band in which the oscillator 
frequency, F, is located; loss and phase detectors to sense the magnitude 
of the loss and phase changes; and a frequency control circuit used in 
frequency interval programming. Before discussing the circuit operation 
in more detail, the three operating modes of the measuring set will be 
described. 


IV. OPERATION OF THE MEASURING SET 


The three modes of operation for the set are manual, semiautomatic, 
and automatic. Regardless of which mode is selected, the operator must 
connect the network to be measured, adjust the terminal impedances if 
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Fig. 4 — Simplified block diagram — measuring and control circuits. 
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they are to be different from 600 ohms, and select the oscillator output 
level. I'ig. 5 shows the console arrangement of the set. 


4.1 Manual Operation 


In the manual operating mode, the oscillator can be manually adjusted 
to --5 cps by means of the coarse tuning control used in conjunction with 
a film scale. The fine tuning control, when used with the frequency 
counter, makes it possible to set frequency to +0.1 cps. The frequency 
counter is operated by a pair of pushbutton controls to give either a 
1-second or 10-second time base. 

Relay switching of the IF filters makes two detector bandwidths avail- 
able in the manual mode, one bandwidth being 10 times the other. The 
narrower bandwidth is normally used when the measurement frequency 
is below 500 eps or at other frequencies when the maximum available 
signal-to-noise ratio at the detector is desired. 

Manual adjustment of a set of decade switches is required to balance 
the loss standard. The loss unbalance indication is provided in analog 
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Fig. 5 — Console arrangement of transmission measuring set. 
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form by a calibrated meter with 3 scales and in digital form by indicat- 
ing lamps placed just above the decade switches. These lamps indicate 
which decade switch should be adjusted to obtain a balance. The manual 
phase balance is obtained in the same way by means of similar indica- 
tions. 

After the frequency has been measured and the loss and phase stand- 
ards balanced, the measured values appear on the visual readout panel. 
Data can either be recorded manually or punched on paper tape by oper- 
ating the print out pushbutton. 

The oscillator can be swept in the manual mode by operating the coarse 
tuning control. As the frequency is changed, indications of the loss and 
phase variations in the measured network can be obtained from the ana- 
log and digital indicators of loss and phase unbalances. 


4.2 Semiautomatic Operation 


Operation of the set in the semiautomatic mode permits the operator 
to initiate, by means of pushbutton controls, any step in the sequence 
which forms the automatic operating cycle. The controls for the separate 
steps are labeled ‘balance loss,” “balance phase,” “read frequency,” 
“display data,” “print out,” and “sweep frequency.” Front panel lamps 
are placed above each control to indicate operation of the set in each one 
of these steps. Operation of the balance loss control results in automatic 
switching of the loss standard pads in an ordered sequence to achieve 
balance. Automatic balance of the phase standard is initiated by opera- 
tion of the balance phase control. The loss and phase balances are made 
to a precision which depends either on the absolute loss or on the setting 
of a manual precision control. 

The read frequency control has the same function as in the manual 
mode: that is, to initiate a frequency measurement with either a 1-sec- 
ond or 10-second time base. 

The function of the display data step is to store on the visual readout 
panel the readings of the loss standard, phase standard, and frequency 
counter. 

As in the manual mode, operation of the print out control causes the 
data to be recorded on paper tape. 

The choice of the frequency for the next data point may be made by 
the operator or by the program control. In the latter case, the range con- 
trol, Af, AL, and Aé@ switches are set for the desired program, and opera- 
tion of the sweep frequency control causes the oscillator to sweep until 
one of the programmed intervals is exceeded. 
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It remains only to note that the sequence just described is not rigid, 
and the various steps can, with a few exceptions, be interchanged in se- 
quence or omitted. or example, if the phase shift of a network is not of 
interest, the balance phase step can be omitted. 


4.3 Automatic Operation 


In the automatic mode, the operator must manually set the oscillator 
to the minimum frequency of interest and set up the desired measure- 
ment program. The precision of frequency measurement must be set to 
0.1 cps, 1 eps, or programmed. In the programmed position, the precision 
is 0.1 eps at measurement frequencies below 500 eps and 1 cps above 500 
cps. After the start control is operated, the set will cycle continuously 
until all desired data is taken. Each cycle consists of 5 steps: balance loss, 
balance phase, read frequency, print out, and sweep frequency. Each 
cycle is timed, and in the event that a cycle is not completed in a pre- 
scribed time, an alarm signal occurs. This alarm also signals the end of 
a run, 


V. DESCRIPTION OF SUBSYSTEMS IN THE MEASURING SET 


Fig. 6 is a partially simplified block diagram of the measuring set, in- 
cluding the control circuits. Functioning of the over-all system will be 
more readily understood if some of the subsystems are considered sepa- 
rately, so the oscillator, loss measuring circuit, phase measuring circuit, 
programming circuit, and some of the other important circuits will be 
separately described. 


5.1 Oscillator 


A detailed block diagram of the oscillator is shown in Fig. 7. The oscil- 
lator is of the heterodyne type, containing one crystal oscillator fixed 
at 97 ke and one LC oscillator which is variable from 97.02 to 117 ke. 
The oscillator modulator converts these two frequencies to the measure- 
ment frequency, which can be varied from 0.02 to 20 ke in a single range. 
The stability of the oscillator output frequency is 0.1 eps over the period 
of a measurement. Two AGC loops are employed in the oscillator to ob- 
tain an output flat with frequency to +0.1 db. The level select pads which 
follow the power amplifier are used to adjust the level applied to the cir- 
cuit being measured. 

Tuning of the variable oscillator is controlled by three capacitors. 
Coarse tuning is accomplished with a servo-driven, variable air capacitor 
geared to a 200-inch film scale. The servo can be operated manually with 
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Fig. 6 — Simplified block diagram — measuring and control circuits. 





the coarse tuning control or automatically with signals from the program 
control. A small variable air capacitor mounted on the front panel pro- 
vides the manual fine tuning control, and the third capacitor is voltage 
controlled for operation in an AFC loop. 

When the AFC relay is operated, the voltage-controlled capacitor is 
connected to the LC oscillator. The control voltage for the variable 
capacitor is obtained from a phase detector which has as its inputs the 
measurement signal and a precise 100-cps frequency. If the open-loop 
measurement frequency is within +10 eps of a 100-cps multiple, energiz- 
ing the AFC relay will cause the measurement frequency to lock to this 
100-cps multiple with no frequency error. 

A lamp and a photodiode are used with the film scale to detect 100-cps 
multiples as the measurement frequency is being swept. A decimal 
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Fig. 7 — Oscillator block diagram. 


counter is used to provide indications for frequency intervals of 100, 300, 
and 1000 eps. 

A cam, mounted on the coarse tuning capacitor shaft, operates a micro- 
switch to select the narrow-band IF filter during automatic operation at 
measurement frequencies below 500 eps. 
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5.2 Loss Measuring Circuit 


Tig. 8 shows the loss measuring circuit together with the self-balancing 
circuit. The test signal from the oscillator, after being attenuated to the 
desired level by the output level selector, is applied to the loss standard 
and the unknown. The loss standard consists of relay-switched attenu- 
ator pads, which can be inserted in 0.01-db steps from 0 to 119.99 db. 

The outputs from the standard (S path) and unknown (X path) 
paths are alternately sampled by a mercury relay operating at either 10 
cps or 1.6 cps, depending on whether the wideband or narrow-band IF 
filter is used. The sampling relay is followed by a high-impedance buffer 
amplifier, so that the input levels and terminating impedances for the 
standard and unknown paths are essentially constant. This permits the 
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Fig. 8 — Simplified block diagram — loss measuring circuit. 
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sampling rate to be chosen independently of the characteristics of the 
unknown network. 

The buffer amplifier is followed by a level control circuit, ganged to 
the oscillator output level selector in such a way that the S path input 
level into the loss detector is kept constant +0.1 db as the oscillator out- 
put level is varied. This is also the raison d’étre for the gain control cir- 
cuits which are ganged to the loss standard. The gain control circuits 
provide an S path input level at the loss detector that is constant to 
0.1 db over the full loss measuring range of the set. Maintenance of 
this constant Jevel makes possible a loss detector sensitivity constant to 
+1 per cent. The gain control circuits also provide a constant S path in- 
put level into the measurement path converter for losses up to 80 db. 
This eases the linearity and crosstalk requirements on the converter. 

The measurement path converter linearly translates the amplitude of 
the 20- to 20,000-cps measurement signal to the 97-ke II’. A beam de- 
flection tube is used as the converter. The tube was used because of the 
high degree of balance which can be maintained for the local oscillator 
signal in the modulator output. The voltage gain to this input is less than 
—76 db. This balance eases the IF filter requirements for measurements 
at low frequencies, since at the lowest measurement frequency of 20 eps, 
the local oscillator frequency is within 20 cps and the upper sideband is 
within 40 eps of the desired lower sideband. The narrow-band erystal 
filter (28-cycle noise bandwidth) is used for automatic measurements at 
frequencies below 500 cps. In the manual and semiautomatic modes at 
measurement frequencies above 500 cps and in cases where a maximum 
signal-to-noise ratio is not required, the wideband filter (280-cycle noise 
bandwidth) is used. It was the settling time of these filters that deter- 
mined the maximum switching rates of the sampling relay. 

The loss detector compares the amplitudes of the IF signal for the § 
and X positions of the sampling relay. The difference in amplitudes is 
proportional to the difference in loss between the loss standard and the 
unknown. This loss difference, or unbalance, is displayed on a calibrated 
panel meter with three full-scale ranges, 0.1, 1, and 10 db. A meter relay 
is connected in series with the panel meter and is used to give a discrete 
indication to the master control when the loss unbalance is less than 
0.015 db. 

A discrete loss detector is connected to the loss detector in parallel 
with the meter circuits to provide additional discrete indications of the 
loss unbalance. The discrete loss detector is an eight-level analog-digital 
converter. The eight levels correspond to loss unbalance magnitudes less 
than 0.01 db or greater than 0.01 db, 0.03 db, 0.1 db, 0.3 db, 1 db, 3 db, 
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and 10 db, together with the sign. The indications are used for balancing 
the loss standard and for loss programming. 

The loss standard control operates the relays of the loss standard in 
accordance with the settings of the front panel control knobs in the 
manual mode, and in accordance with the state of a 4-stage reversible 
decimal counter in the semiautomatic and automatic modes. 

When an automatic loss balance is being made, balancing pulses gener- 
ated by the master clock are gated into the loss standard control by the 
master control. The loss standard control then gates the pulses into the 
counter stage controlling the most significant decade requiring a change. 
Control for this gating is furnished by the discrete loss detector, and the 
result is a rapid balancing sequence. 

Since the detector signal-to-noise ratio decreases as the measured loss 
increases, automatic control of the precision of balance is required during 
automatic measurements. The precision of automatic balance versus 
measured loss is given in Table I. For cases where precision less than that 
provided by the automatic control is necessary or desirable, a manual 
precision control is provided. 

When a balance has been achieved, as indicated by signals from the dis- 
crete loss detector and the loss detector meter relay, balancing pulses are 
no longer gated to the loss standard control. If the balance achieved con- 
dition is maintained for a period of 4 successive balancing pulses, the 
loss balance is considered complete and, in the automatic operation mode, 
the phase balancing step is begun. 


5.8 Phase Measuring Circuit 


The phase measuring circuit is shown in Fig. 9. Since the phase detector 
indication is affected by the input levels, the loss balance must precede 
the phase measurement. 

The sampling relay in the comparison unit samples the phase difference 
in the S and X paths. This phase difference is translated by the measure- 


TABLE I 

Measured Loss Precision of Automatic Balance 
0-39 db 0.01 db 
40-59 db 0.03 db 
60-79 db 0.1 db 
80-89 db 0.3 db 
90-99 db 1 db 
100-120 db 3 db 
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Fig. 9 — Simplified block diagram — phase measuring circuit. 


ment path converter to its negative at II’. The phase standard is switched 
in synchronism with the sampling relay in such a way that the phase 
variation at the measuring path input to the phase detector is equal to the 
phase difference between the unknown and the phase standard. This 
phase is compared to the constant phase of the reference path by the 
phase detector, and the phase difference between the unknown and the 
phase standard is displayed on a calibrated panel meter. The meter has 
three full-scale ranges, 1°, 10°, and 80°. A meter relay is also used here to 
give a discrete indication when the phase unbalance is less than 0.15°. 
The deflection sensitivity of the phase detector is maintained constant 
within -+-1 per cent by controlling the input levels to the phase detector 
and by using a voltage-controlled phase shifter in a feedback loop, so 
that the output of the phase detector will be zero when the sampling re- 
lays are in the 8 position. 
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The phase standard is a digital standard which can be relay-switched 
to vary phase from 0 to 360° in 0.1° steps. The phase standard is balanced 
in essentially the same way as the loss standard. The precision of the 
phase balance, which depends on the signal-to-noise ratio, is determined 
either by the insertion loss of the network being measured or by the set- 
ting of the manual precision control. 


5.4 Program Control Circuit 


As shown in Fig. 10, the program control receives inputs from the dis- 
crete loss and phase detectors, the frequency counter, and the oscillator 
Af counter. When the oscillator is being swept during semiautomatic or 
automatic operation, the frequency counter is cycled continuously, using 
a 0.1-second time base. The swept frequency is measured with a maxi- 
mum error of 22 eps, and the output from the frequency counter is com- 
pared to the settings of the range selector switches. This comparison in- 
dicates which of the selected ranges the measurement frequency lies in, 
and also determines which set of Af, AL, and Aé@ switches control the 
program. The settings of the appropriate Af, AL, and Aé@ switches are 
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Fig. 10 — Simplified block diagram — programming circuits. 
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compared to the outputs of the Af counter, the discrete loss detector, and 
the discrete phase detector. Coincidence circuits indicate a match when 
it occurs and stop the oscillator sweep. If the oscillator is stopped because 
of a Af interval being equalled, a signal is sent to the oscillator to engage 
the AFC loop and reset the Af counter. 

The speed of the oscillator sweep is controlled by the program control. 
This speed is proportional to the programmed AZ or Aé@ interval, which- 
ever requires the slower speed. When the indicated loss or phase unbal- 
ance reaches 4 of the programmed interval, the servo speed is reduced. 
These two controls of the oscillator sweep speed make possible a reason- 
ably short sweep time without, at the same time, causing large errors in 
the programmed intervals due to overshoot. Section 6.2.3 shows the 
considerations involved in the choice of sweep speeds. 


5.5 Other Important Circutts 


In addition to the circuits already discussed, there are several others 
which deserve a brief description. 


5.5.1 Frequency Counter 


The frequency counter can be controlled to measure frequency with 3 
time bases. When used with the 1-second and 10-second time bases, the 
counter takes a single measurement, and the reading is used for the visual 
and punched tape readouts. When the 0.1-second time base is used, the 
counter takes measurements continuously, and the readings are used to 
provide inputs to the range control circuit. 


5.5.2 Readout Control 


The front panel of the readout control contains the in-line readout in- 
dicators to display the measured frequency, loss, and phase and 6 selector 
switches to provide an identification number for the punched tape read- 
out. The readout control unit also contains the required translators for 
the visual and punched tape readouts and the parallel-to-serial converter 
for the tape readout. A circuit is also provided to interrupt the recorded 
readout in the event of a failure in the data parity check. 


5.5.3 Master Clock 


All timing pulses necessary for the system are generated by the master 
clock. The clock has two rates, 10 cps and 1.6 eps, for use with the wide- 
and narrow-band II filters, respectively. Pulses from the clock go to the 
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sampling relays, loss detector, discrete loss detector, loss standard con- 
trol, phase detector, discrete phase detector, and phase standard control. 


5.5.4 Master Control 


Sequence control is the best description of the function of the master 
control unit. In the automatic mode, it monitors the operation of the 
balance loss, balance phase, read frequency, print out, and sweep fre- 
quency steps, and controls the order in which they are operated. The 
master control panel contains pushbutton controls to operate each of 
these steps in the semiautomatic mode. An error control is provided 
which allows the operator using the manual or semiautomatic mode to 
designate the most recently punched data point as an error. Finally, the 
panel contains the all-important control which the sorcerer’s apprentice 
could not find, the one marked “stop.” 


VI. SYSTEMS CONSIDERATIONS WHICH INFLUENCED TEST SET DESIGN 


The most important objectives in the design of the test set were to 
achieve the specified measurement ranges and accuracies and to provide 
the means for rapid and accurate data selection based on settings of the 
program control. Some of the factors which were considered in the test 
set design will now be discussed. 


6.1 Measurement Errors 


The following sources of error were considered in the test set design: 
(a) loss standard imperfections 

(b) phase standard imperfections 

(c) insertion ratio errors due to misterminations 

(d) harmonies and modulation products 

(e) IF bandwidth and switching rate of comparison unit 

(f) random noise 

(g) power frequency pickup 

(h) crosstalk and 

(i) drift. 


6.1.1 Loss Standard 


It was required that all steps from 0 to 40 db be accurate to +0.002 
db and that the phase shift of the loss standard be independent of loss 
to within +-0.02° for frequencies from 20 cps to 20 ke. To meet the re- 
quirements on attenuation accuracy, stable wire-wound resistors with the 
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required accuracy were used. To meet the phase requirements, it was 
necessary that the design impedance of the loss standard be made less 
than 600 ohms. For convenience in measuring the loss and phase of the 
standard with existing equipment, the design impedance was made 75 
ohms. After compensation, variations of loss setting produced no ob- 
servable phase changes on a phase detector with 0.01° sensitivity. 


6.1.2 Phase Standard 


All steps of the phase standard were required to be accurate to +0.02°, 
with the output level variations between any two steps less than 1 db. 
The unit consists of 3 stages of relay-switched RC elements. Low react- 
ance, wire-wound resistors and high-Q temperature-compensated silver 
mica capacitors are used to achieve the necessary component stability of 
0.01 per cent and the low output level variation. The phase standard was 
calibrated on the X75706 TMS (Ref. 1, p. 1515), which has the required 
phase accuracy of 0.02° at 97 ke. The phase standard is also frequency- 
sensitive, so a requirement of +0.01 per cent, or about +10 cps, was 
placed in the IF frequency stability. 


6.1.3 Errors Due to Misterminations 


For the comparison circuit shown in Fig. 11, the maximum error, ¢, 
in measuring insertion loss and phase between nonideal terminations is 
approximately,” 


o= [Sue p+ Sal | 4 IGE) + swie’ | + seb" een 


where ¢ is in nepers and radians. The quantities G, L, Su, and See are 
the reflection coefficients of the generator termination, load termination, 
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Fig. 11 — Comparison circuit. 
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and the physical input and output scattering coefficients of the network 
under test, all taken with respect to 600 ohms. The quantities G’, L’, 
Si’, and S22’ are similarly defined for the loss standard path in the com- 
parison circuit and are taken with respect to 75 ohms. 

G, L, G’, L’, Su’, and S22’ are under the designer’s control. The re- 
sistor elements used in the generator and load terminations are within 
0.05 per cent of nominal, and in general the scattering parameters Sy’ 
and S22’ of the loss standard are much smaller than Sy and Sx. Thus, 
for an unknown with scattering coefficients as large as 0.2, the maximum 
error will be 2 X 0.2 X 0.00025 = 10~*. This corresponds to an error of 
0.0009 db or 0.006°. 


6.1.4 Harmonics and Modulation Products 


One group of products of special concern are those which produce 
spurious signals of frequency / at the input to the IF converter and of 
frequency A at the output of the IF converter. These products are: har- 
monies of / present in the oscillator output, A or / + A present in the 
oscillator output, and F or A present in the /’ + A signal. The require- 
ment on the maximum magnitude of these products depends on the 
linearity of the gain and level control amplifiers and the I modulator 
linearity and gain. Requirements were placed so that all spurious sources 
of A at the IF modulator output are 65 db below the IF signal for losses 
of 50 db or less and 20 db below the IF signal at 120 db loss. 

Other products of special concern are the F + A and 2F + A products 
in the IF converter output as F approaches 20 eps. The IF converter is 
balanced against F + A and maintains a A/(F + A) ratio of 30 db or 
greater in the converter output. No balance is obtained against 2F + A, 
so the narrow-band IF filter must provide 35 db discrimination at F + A 
and 65 db discrimination at 2F + A. This very steep cutoff requirement 
on the IF filter imposes the limitation on the operating speed of the 
measuring set. 


6.1.5 IF Bandwidth and Switching Rate of Comparison Unit 


The filter that provides the necessary narrow IF bandwidth will also 
have a large envelope delay and long settling times. The total filter tran- 
sient time determines the maximum switching rate of the comparison unit: 
and the maximum rate at which the loss and phase standards can be 
balanced. Since one of the objectives in automating the set was to oper- 
ate at high speed, a wide filter bandwidth is desirable. Asa compromise, 
two IF filters are used. During automatic operation from 20 cps to 500 
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eps, a filter with a 3-db bandwidth of 18 eps is used and from 500 eps to 
20 ke, the 3-db bandwidth is 180 eps. The filters have a maximally flat 
delay characteristic which gives a relatively short transient time.? The 
two switching rates used are 10 cps and 1.6 eps. 


6.1.6 Random Noise 


The presence of random noise in the measuring circuit will cause both 
fixed and fluctuating errors in the indications of the loss and phase de- 
tectors. The fixed errors cannot be detected and thus produce errors in 
the measurements. The fluctuating errors can be detected and eliminated, 
providing they are averaged for a sufficiently long period. No averaging 
is done in the discrete detectors used in automatic balancing of the loss 
and phase standards, so the precision of automatic balance is reduced as 
the noise level increases. The precision of automatic balance versus level 
is given in Section 5.2. 

The noise bandwidth of the system is determined by the bandwidth 
of the IF filters. The noise bandwidth of the narrow filter is 28 cycles and 
that of the wide filter is 280 cycles. For losses greater than 20 db, the low- 
level point of the system is at the comparison switch (8, , Fig. 6). The 
noise figures of the 0-db buffer amplifier and the level and gain control 
amplifiers which follow 8; determine the system noise figure. The system 
noise figure varies with frequency, being 28 db at frequencies above | ke 
and increasing to 45 db at 20 eps. 

For 40-db loss measurements and 0-dbm sending level, the minimum 
signal is —40 dbm. At frequencies greater than 1 ke, and for a 280-cycle 
noise bandwidth, the minimum signal-to-noise ratio is 81 db, which is 
greater than the required for a measurement with 0.01 db precision. 
For 120 db loss and a 28-cycle noise bandwidth, the minimum signal-to- 
noise ratio is 11 db. This allows 3-db precision with the discrete detector. 


6.1.7 Power Frequency Pickup 


Harmonics of the power frequency which are coupled into the measure- 
ment circuits also cause fixed and fluctuating errors in the loss and phase 
detector indications. These errors occur whenever the measurement fre- 
quency is close enough to one of these harmonic frequencies that the IF 
filter does not discriminate against it. In the automatic and semiauto- 
matic modes, a blanking circuit is provided to prevent the AD and Aé@ 
program controls from stopping the oscillator within +15 eps of 60, 180, 
or 300 eps. Inside these bands, the outputs from the discrete detectors 
are not reliable for programming and balancing. With the use of the 
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blanking, measurements may be made over the 20- to 20,000-cps range 
with a precision of 0.01 db for losses up to 40 db. 


6.1.8 Crosstalk 


Since the signal level into both modulators is kept constant for losses 
up to 80 db, crosstalk at IF or through the modulators was not a difficult 
problem. 

The main source of crosstalk was in power supply coupling. Decoupling 
is especially difficult in those units operating at 20 cps, since effective 
reactive components become very large at that frequency. Some de- 
coupling was effected by using separate power supplies, and in other 
cases an active filter was used to provide decoupling and attenuation of 
power supply ripple. 


6.1.9 Drift 


The 1.6-cps switching rate requires that drifts in the common path 
circuits be less than 0.01 db and 0.1° during the 0.6-second period. 


6.2 Programming Errors 


As discussed in Section II, the measurement programming permits the 
division of the measurement frequency range into as many as 5 bands, 
and in each of these bands a selection of Af, AL, and Aé intervals may be 
made. A natural question to ask is, how accurate are these intervals when 
point-by-point measurements are made? 


6.2.1 Range Control Error 


As the input to the frequency counter is varied at a rate of K cps/sec- 
ond over the counting period 7 seconds, the maximum error, /, in the 
counter reading will be, 


For 7 = 0.1 second and with the maximum sweep rate of 240 eps/sec- 
ond, an error of 22 eps can result. This error in counter reading can cause, 
in the worst case, a range control error of 33 eps. 


6.2.2 Af Program Accuracy 


The low-inertia servo motor selected for the oscillator sweep drive and 
the relatively high-speed Af recognition circuitry make it possible to stop 
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at any Af frequency interval with an overshoot of less than 10 eps. When 
the oscillator drive is stopped, the oscillator frequency is phase locked to 
a multiple of a 100-cps frequency standard, thus giving exact Af intervals. 


6.2.3 AL and A@ Program Interval Accuracy 


Measurements of the AL and A@ program intervals are made with a 
frequency source which is being swept, on unknowns with finite band- 
widths, using a narrow-band detector. In most cases, the delay introduced 
by the detector will be long compared to the delay present in the meas- 
ured networks. The inertia of the oscillator drive motor and the detector 
delay will cause the oscillator to overshoot the frequency where the AL 
or A@ interval is exactly reached. Referring to Fig. 12, it is seen that the 
program error I’, , for loss is 


te 
Ey, = n loge > 
L 2 fi 
where n = loss slope in db/octave 
fe = frequency where oscillation is stopped 


fi = frequency where AL is reached. 
Yor a delay of 7 in recognizing AL and a sweep speed of K cps/second 


E, = n loge (1 es ie) = 144 a 


for small Z, and fi arbitrary. If the fractional program error P is de- 
fined as 





_ Ly 
ays 


then the acceptable rate of sweep for loss intervals is 


_ il fAL 
ae 

The desirable fractional program error and 7 are considered constants, 
and AL depends on the program input. It can be shown that by using the 
differentiated output of the loss detector fed back to control the oscillator 
sweep speed, K, a constant fractional program error can be achieved. 
Similar results can be obtained for phase programming. 

The approach used in this set was to sweep the oscillator at a rate pro- 
portional to AL over the loss range from 0 to AL/3 in such a way that a 
program error of just less than 60 per cent will occur. When AL/3 is 
recognized, the sweep rate is reduced to about 3 of the original rate to 
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Fig. 12 — AL program error. 


limit the overshoot to something less than 10 per cent. The sweep for 
phase intervals is controlled in the same way, and in cases where both 
AL and Aé are programmed, the slower sweep speed called for is used. 
The discrete detectors which measure the AZ and A@é intervals are ac- 
curate to +2 per cent. 


6.3 Operating Speed 


The maximum speed of operation of the set is obtained in the auto- 
matic mode. When the set takes a data point, it goes through a cycle 
consisting of sweep frequency, balance loss, balance phase, read fre- 
quency, and print out. 

As discussed in Section 6.2, the sweep rate is limited by the allowable 
program error. The sweep period could be minimized if a more complex 
feedback arrangement were used along with an oscillator drive motor 
with a 1000:1 speed range. The motor which was used has a speed range 
of 15:1 and the maximum sweep speed is 240 cps/second. 

In the balance loss operation, the IF filter forms part of the control 
loop. The long envelope delay and settling times associated with the 
narrow-band filter constitute almost all of the loop delay, with the re- 
mainder being the conversion time of the discrete detector and the delay 
of the balancing pulse gates. The balancing rate is 10 pps with the wide- 
band filter and 1.6 pps with the narrow-band filter. Even though the IF 
filter is not in the balance phase control loop, for practical reasons the 
balancing rate and logic are similar to those used in the loss standard. A 
timeout of 4 pulses is used after each of the balances before the balance is 
considered achieved. 
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In the read frequency operation, either a 1-second or 10-second time 
base is used for the frequency counter. 

The speed of the readout is limited by the maximum speed of the re- 
cording tape punch and the stepping switch used in the parallel-to-serial 
data conversion. Without contact protection, the stepping switch is 
about 3:1 faster than the tape punch. Since 40 characters are punched at 
each measurement point and the maximum speed of the punch is 20 
characters per second, the lower bound for readout is 2 seconds. The ac- 
tual readout time, with contact protection on the stepping switch, is 
just under 3 seconds. Section 7.4 gives the breakdown of time in each 
step of a typical cycle. 


VII. RESULTS WITH THE AUTOMATED SET 


Tests were made to determine the measurement range and accuracy 
of the set. Selected networks were used to determine the accuracy of 
programmed intervals and the operating speeds, and to evaluate to some 
extent the efficiency of data selection. 


7.1 Measurement Accuracy 


Verification of the loss and phase accuracies of the set was carried out 
by a process of measuring a number of calibrated loss pads and delay 
networks individually and in combination. These measurements showed 
that the objectives of 0.01 db and 0.1 degree accuracy were met for losses 
up to 40 db. A nominal 120-db loss was measured as 114 db. 


7.2 Program Accuracy 


Measurements were made using selected networks to determine the 
accuracy of the Af, AZ, and Aé intervals. 


7.2.1 Af Program Accuracy 


When a programmed Af limit is reached, the oscillator is then phase 
locked to a 100-cps frequency standard so that the frequency intervals 
are, for all practical purposes, exact. Some jitter and 100-cps sidebands 
do appear at the oscillator output, but the resulting variations in the in- 
stantaneous oscillator frequency are less than 0.003 per cent or 0.03 
cps, whichever is larger. 


7.2.2 AL Program Accuracy 


Each of the AL intervals was programmed on a network with the 
maximum loss slope which can be measured to 0.01 db (0.12 db/eps). 
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The error in the programmed intervals is 10 per cent or 0.05 db, which- 
ever 1s larger. 


7.2.3 A@ Program Accuracy 


Each of the A@ intervals was programmed on the maximum phase 
slope which can be measured to 0.1° (1.44°/eps). The error in the inter- 
vals is 15 per cent or 0.8°, whichever is larger. From these results, it can 
be seen that the minimum frequency change that will occur in the pres- 
ence of steep loss or phase slopes is about 0.5 eps. 

For A@ intervals programmed on a 25-usec delay line (slope 0.01°/eps), 
the program error is 2 per cent or 0.1°, whichever is larger. 


7.3 Data Selection 


7.3.1 Delay Equalizer 


A simple example of automatic data selection is provided by measure- 
ments on a delay equalizer. Loss and phase are measured from 500 to 
3400 eps, and loss must be known to the nearest 0.1 db. The program used 
is AL = 0.1 db and Af = 100 eps. Delay is calculated on a computer 
from the phase measurements, and the AZ program characterizes the 
unknown within the required 0.1 db. The Af program eliminates any sig- 
nificant uncertainty in the measurement frequencies and thus removes 
one source of error in the delay measurement. The data obtained are 
plotted in Fig. 13. 


7.3.2 Bandpass Filter 


As a second example of automatic data selection, a program was set 
up to a test a telegraph tone channel filter with the requirements listed 
in Table IT. 

Assuming that the band of interest is from 1 to 5 ke and that the phase 
linearity in the pass band is also of interest, the program was set up as 
shown in Table III. 

The data taken are plotted on Fig. 14. In ranges 1 and 5, all of the 
points are taken by the Af = 300 program, but one has the assurance 
that the loss did not deviate by more than 10 db between any two ad- 
jacent points. The AL = 3 db program is used in ranges 2 and 4 where the 
16-db and 7.5-db loss requirements are placed. In range 3 the phase 
change of about 220° is approximately linear with respect to frequency, 
so the A9 = 30° program provides 7 points spaced at approximately uni- 
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Fig. 13 — Automatic data selection — delay equalizer. 
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Fig. 14 — Automatic data selection — bandpass filter. 
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form frequency intervals. The resulting data is similar to what one would 
expect if it were taken by a capable human operator. 


7.4 Operating Speed 


The program described in Section 7.3.2 required 6.5 minutes to run, 
with an average of 14 seconds required for each point. This average time 
per point can be broken down as shown in Table IV. 

In general, except for the read-out step and the miscellaneous delays, 
the time required for each step varies according to the switching rate 
used, the size of the programmed intervals, the precision of the measure- 
ments, and the unknown being measured. However, from the data taken 
so far, 14 seconds appears to be a good average. 

Manual measurements were made on the filter described in Section 
7.3.2 at approximately the same points plotted on Tig. 14, and the time 
required for the manual measurements was more than 5 times that re- 
quired for the automatic measurements. Experience indicates that the 
average speed of the automated set is at least 10 times that of manual 
sets. 


7.5 Conclusions 


An automated laboratory measuring set has been developed which 
makes loss and phase measurements based on a preselected program and 
on information fed back from the measured network. The program pro- 
vides the means for accurate and relevant data selection from simple in- 
structions based on network requirements. Automatic read-out eliminates 
operator read-out errors, and machine processing of the paper tape out- 
put provides flexibility in the form of output data. The possibility of us- 
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ing the set in the semiautomatic and manual modes provides additional 
flexibility. 

The higher speed of the automated set makes it possible to obtain 
more comprehensive data on networks being measured, with the at- 
tendant benefit to both designers and users, and this data is obtained in 
less time at lower cost. 
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Signal Distortion in Nonlinear 


Feedback Systems 


By I. W. SANDBERG 
(Manuscript received May 6, 1963) 


This paper reports on some properties of the solutions to the functional 
equation s(t) = g[Cse(t) + s,(t)], where ¢ is a nonlinear function, the 
operator C is a convolution, and s, 1s a known function belonging to a 
prescribed Banach space. 

The equation plays a central role in the theory of signal transmission 
through a general physical system containing linear time-invariant ele- 
ments and a single time-variable nonlinear element. In particular we es- 
tablish conditions under which s2(t) is the fixed point of a contraction 
mapping of the Banach space into itself and we discuss some consequences 
of this result. 

As a direct application, we consider the range of validity of two simple 
cascade flow graphs (i.e., flow graphs without feedback loops) for approxt- 
mately determining the signal distortion in nonlinear feedback systems 
when the distortion ts small. Our discussion is not restricted to specific 
types of nonlinear characteristics. 


I. INTRODUCTION 


This paper reports on some properties of the solutions to the functional 
equation s(t) = g[Cs.(t) + s,(t)], where ¢ is a nonlinear function, the 
operator C is a convolution, and s, is a known function belonging to a 
prescribed Banach space. 

The equation plays a central role in the theory of signal transmission 
through a general physical system containing linear time-invariant 
elements and a single time-variable nonlinear element. In particular we 
establish conditions under which s.(t) is the fixed point of a contraction 
mapping of the Banach space into itself and we discuss some conse- 
quences of this result. 

As a direct application, we consider the range of validity of a simple 
cascade flow graph (i.e., a flow graph without feedback loops) for 
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approximately determining the signal distortion in nonlinear feedback 
systems when the distortion is small. Our discussion is not restricted to 
specific types of nonlinear characteristics. 

IXycept in cases in which the nonlinearity is very small, our results 
establish the utility of the graph only when (in a certain precise sense) 
the feedback around the nonlinear element is small. However, our re- 
sults show that the range of validity of this flow graph is very much 
greater than that indicated by an earlier writer’ who has considered this 
question for the case in which the nonlinear characteristic is of the form 
x + ex”, where e and mare real constants with m an odd positive integer. 

As is well known, large amounts of feedback are often present in 
physical systems. In fact, large amounts of feedback are often used to 
reduce nonlinear distortion. Consequently the established range of 
validity of the flow graph mentioned above does not include the most 
important cases of interest. To deal with such situations we propose an 
alternative, but very closely related, flow graph for approximately de- 
termining the signal distortion when the distortion is small. It appears 
that the range of validity of this graph includes the vast majority of 
cases of engineering interest. 

Section II considers some mathematical preliminaries. In Section ITT 
we describe a model of the physical system under consideration and 
show the relevance of the functional equation mentioned above. Section 
IV presents some preliminary results which are concerned with the 
properties of certain linear operators. In the remaining sections we 
consider both some properties of the solutions to the functional equation 
and engineering implications of the results. 


II, MATHEMATICAL PRELIMINARIES 


Let ® = [0,p] be an arbitrary metric space.” A mapping A of the space 
& into itself is said to be a contraction if there exists a number k < 1 
such that 


p(Ax,Ay) S< kp(x,y) 


for any two elements z,y ¢ ©. The contraction-mapping fixed-point 
theorem’ is basic to much of the subsequent discussion. It states that 
every contraction mapping defined in a complete metric space ® has 
one and only one fixed point (i.e., there exists a unique element z ce © 
such that Az = z). Furthermore z = lim A”2», where 2p is an arbitrary 


element of 0. 
Throughout the discussion £2 denotes the space of complex-valued 
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square-integrable functions defined on the real interval (— ©, ). The 
norm of f(t) € £2 is denoted by || f ||2 and is defined by 


writ = fi [so Pa 


The symbol £1 denotes the space of absolutely integrable functions de- 
fined on (— ©, ). We shall use the symbols Lor and Lr, respectively, 
to denote the intersections of the spaces £2 and £1 with the set of real- 
valued functions. It is well known that £2 and £22 are Banach spaces. 
We take as the definition of the Fourier transform of f(t) ¢ £2. U &; 


F(w) [ ¥G).e at fe Ly 


F(w) 


Lim. [ f() edt, fe (£2 — £1) 
and consequently when f(t) & Lz 
$0) a [ F(w) ef! de. 
2a — 0 
With this definition, the Plancherel identity reads 
ae [ s(dg(t) dt = | F(w)G(w) dw, fg e £2. 


Throughout the discussion K(Z) denotes the space of bounded real- 
valued functions that (7) are defined on the real interval (— ©, ) and 
(72) are continuous on (—«,«0) — 2, where > is an arbitrary fixed 
finite or infinite set of isolated points.* The norm of f € K(2Z) is denoted 
by || f |]... and is defined by 


If Ilo = sup fC) [. 


With this norm (2) is a Banach space.{ The norm of a linear operator 
Q defined on K(Z) is denoted by || Q ||.. and similarly for the norm of a 
linear operator defined on £2. 
We shall say that a real-valued function f(t) belongs to D if and only 
if there exists a function f(t) that agrees with f(t) almost everywhere 
* Various signals of interest in communication systems such as pulses are not 
contained in (2) if = is the null set. 
Any Cauchy sequence of functions belonging to K(2) converges to a bounded 
continuous function on (— »,«) — Z (since the sequence converges uniformly on 


(— 2,0) — 3). Since, in addition, the sequence converges at each point of dis- 
continuity, K(Z) is complete. 
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and is such that the set of points at which* sign [f(t)] is discontinuous 
is a set of isolated points. 

The symbols I and O are used theoueaout to denote, respectively, the 
identity operator and the null operator (i.e., for all f, Of = 0). 


III. MATHEMATICAL MODEL OF THE PHYSICAL SYSTEM 


Consider a physical system containing linear time-invariant elements 
and a single time-variable nonlinear element. Let s: and se , respectively, 
denote the system’s input and output signals and let v and w, respec- 
tively, denote the input and output signals associated with the non- 
linear device, which is assumed to be characterized by the equation 


w = o(v,t) = gr] (1) 


in which ¢(v,t) is a real-valued function of the real variables v and ¢. 
We shall consider separately two cases: 
(7) s1 ,S2 ,v,w € K(Z) for some Z 
(17) 81 ,S2 ,V,W E Lor. 
It is assumed in each case that there exist well-defined linear operators 
Tr and A such thatft v = IF[s; ,w] and sp = Als, ,w]. It is convenient to 
define four linear operators A, B, C, and D in the following manner 


v = T7[s1,v] = Ts; 0] + F0,w] 


(2) 
= Asi -+ Cw 

8s = Als; uv] _ Als; 0] fi A[0,w] (3) 
= Ds; ++ Bw. 


The relation between s; , se , v, and w is summarized by the flow graph in 
Fig. 1. 

As a very simple illustration of the generality of the graph in Fig. 1, 
observe that the flow graph of the classical single loop feedback system 
in Fig. 2(a) in which E and F are linear operators can readily be reduced 
to the form shown in Fig. 1. The reduced graph is given in Fig. 2(b). 

Our concern is with the influence of the nonlinear element represented 
by ¢. Hence it is sufficient to consider the situation in Fig. 1 in which 
D = O and A = B =I. The corresponding graph is shown in Fig. 3. 
For this graph, using (1), (2), and (3) 


Se = g[Cse + 53]. (4) 


* Let sign (f@)] = 1 when f (@) = 
t This is essentially the same re adek used by the writer in another study.® 


SIGNAL DISTORTION 2537 





Fig. 1— Flow-graph representation of a general transmission system contain- 
ing linear elements and a single time-variable nonlinear element ¢. 


3.1 The Time-Variable Nonlinear Element and Definition of Signal Dis- 
tortion 


It is assumed throughout that ¢[f(t)] is measurable whenever f is 
measurable, that 9(0,¢) = 0 for all ¢, and that for all ¢ and-all 1 = 


a(v — 2) S v(t) — (vet) S Bui — v2) 


where a and £ are real constants. 


E | FE 
MJ NS et Mo 
Sy F ~ Sp St ~ Se 


Fig. 2— Two flow graphs with identical transmission from s; to se. 


In our application of the theorems in Sections V and VII, we shall 
suppose that ¢(v,t) = v + ¢(v,t) where, for all ¢, d(v,t) is of order less 
than v as v — 0. That is,* we shall suppose that for sufficiently small 


Cc 
Pe eee cs 


Fig. 3 — Basic flow graph for studying the influence of ¢. 


input signals the element represented by ¢ behaves essentially as a unit 
gain amplifier and hence that, for such signals, the system in Fig. 1 
behaves essentially as a linear time-invariant system. Let so. denote the 


* Note that in the frequently encountered case in which ¢ is independent of ¢ 
and ¢’ is a monotone decreasing function of v, 8 = 1. 
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output signal se in Fig. 1 when ¢(v,t) = v. We shall say that (s. — so2) 
is the signal distortion introduced by the departure of ¢(v,t) from ». 


3.2 The Operator C 


Unless stated otherwise, it is assumed that 


of = [elt — 2) slr) ar (5) 


where c(¢) is a real-valued function of ¢. In cases of engineering interest 
C is a causal (i.e., c(t) = 0, ¢ < 0). However our mathematical results 
are not restricted to cases in which C is causal. 


IV. PRELIMINARY RESULTS 


This section is concerned with a proof of 
Theorem I: (a) Let c(t) © &z and Cw) ¥ 1. Then (I — C) tsa bounded 
mapping of K(Z) into itself that possesses a bounded inverse. In fact, 
there exists a function h(t) © Liz, with Fourier transform CWw)[1 — 
C(w)]", such that 


(I- cy'g =o + [ae a)g(r) ar 
for any g © K(2). If h(t) € 9, 
|= Oy te = 1+ [Rw Jae 
(b) Suppose alternatively* that c(t) © Ler , ess sup |C(w) | < &, and 
that inf [1 — C(w) | > 0. Then (I — C) is a ended mapping of Lor 


tnto atself that possesses a bounded inverse. Moreover 


| (I — C)™ Je = ess sup Jl — Cw) [7 


| C |le ess sup | C(w) |. 


4.1 Proof of Part (a) 
Since c(t) e Lip , the validity of the assertion that (I — C) isa bounded 


* The notation ess sup Q(w) denotes inf sup Q@w) where XU is an arbitrary zero- 
w M w£9t 
measure subset of the real line. In at least almost all cases of engineering interest, 


the “‘essential supremum”’ of the modulus of a Fourier transform is equal to its 
supremum. 
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mapping of K(Z) into itself is obvious. For the remainder of part (a) 
we need 

Lemma I: Let c(t) € £irn and C(w) # 1. Then there exists a function h(t) 
e Lin, with Fourier transform C(w) [1 — C(w)]', such that 


n(t) — ot) = [h(t = rele) dr 
almost everywhere. 


4.1.1 Proof of Lemma I: 
It is known** that if c(t) ¢ £12 and inf | 1 — C(w) | > 0, there exists 


an f(t) of bounded total variation on (— ©, ) such that 


ice) 


i - cw) = fal. 


Under these conditions, it follows that 


ni) = [lt — ») afte) 


is an element of £12 which possesses the required Fourier transform. 
However, since C(w) is uniformly continuous and C(w) ~ 0asw— o, 
the inequality inf | 1 — C(w) | > 0 is satisfied if and only if C(w) # 1. 


Thus the assumptions in Lemma I imply the existence of a function 
h(t) © Liz with the stated transform.t Since 


C(w) C(w) 


l=C&) a eae Cla) 


C(w) 
the Iourier transforms of [A(t) — c(t)] and 


i h(t — r)e(r) dr 


are equal. This establishes the equation stated in the lemma. 
Let g(t) denote any element of (2) and assume that there exists 
an f(t) ¢ K(Z) such that 


* The writer is indebted to V. E. Bene for directing attention to the result in 
Ref. 4. 

j A moment’s reflection will show that when C(w) is rational in w, a proof of this 
result follows directly from the identification of the terms in its partial-fraction 
expansion. 
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gt) =-OF=K)-[ olt— ale) dr 6) 


It is certainly true that (6) implies 


[ne = vale) ar 
. ‘ : (7) 
Z i i= D4@in= 3 haa) | [oc ~ u)f(u) au | ar. 


Since f is bounded and h,c € £:z , Fubini’s theorem implies that the last 
integral can be written as 


iB flu) i h(t — r)e(r — u) ar | du 


and hence, in accordance with the lemma, as 


[ na =n sr) a — [ott — Fr) a. (8) 

Therefore, 
[ne = nal) ar = [elt =) He) ar = f(t) ~ gd). 
Thus, if there exists an f e K(Z) such that (I — C)f = g, 
f(t) = g(t) + [alt= r)g(r) ate. (9) 

However, direct substitution and an application of Fubini’s theorem 
show that the right-hand side of (9) is a solution of (6). Hence it is 
the solution. 


Since || (I — C)™ ||. = sup{ ||fll.:(I — C)f = g3 fg © K(2); 
ll g Ilo = 1}, it is evident from (9) that 


Ha@- ope sit f [aw fae 
We shall next show that if h(t) ¢ D 
}@-cytoz1+ flac) lar~s, 
where 6 is an arbitrary positive number and hence that 


|| I - Cc)“ |. = 1 + [_ [al lat 
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Choose a real number fy) and consider 


gto) + [lt — r)g(+) dr. (10) 


We may assume that the set of points at which sign [h(t)] is discontinuous 
is a set of isolated points. Let = denote the union of the closed intervals 
of length 6, centered at t, and at each of the discontinuities of sign [A(t) — 
r)] regarded as a function of 7. Let =, denote the complement of 4 
with respect to the real line. I‘or any 6 > 0, choose 6, such that* 


is ee ae Sis 
Choose g(t) € K() such that g(t.) = 1, || g ||. = 1, and 


g(r) = sign [A(ty) — 7)], TER, 
Then 


gts) + [Wl — role) de = 1+ [jm ae 
a [ h(to — r){g(r) — sign [h(to — 7)]} dr, 
and 


g(t) + flo — r)g(r) dr] = 1+ [nw asa 





This completes the proof of the first part of Theorem I. 
Of course similar arguments show that if c(¢) © D, 


[Cle = fo ote) | at 


4.2 Proof of Part (b): 


The proof of this part involves essentially the same arguments pre- 
sented elsewhere.’ 
Let f ¢ Lox . Then, using Plancherel’s identity, 


* The ania of i (ae — r) | over = does not exceed the sum of the integrals 
over |7 |= Tandz — [—T,7], where [—7',7'], denotes the complement of [—7',7’'] 
with respect to the real line. The first integral can be made arbitrarily small by 
choosing 7 sufficiently large and with fixed 7 the second integral can be made 
arbitrarily small by choosing 6; sufficiently small. 
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| CF lls" 


Lf 2 
5 ic | C(w) Fw) P de 


ess sup | C(w) |? |] f le”. 


IA 


Thus C, and hence (I — C), are bounded. 
Now consider the equation 


(I — C)f =; ge Lor, lg lle = 1 
Since inf |1 — C(w)| > 0, there exists a unique solution f © Lor and 
F(w) = G(w){1 — C(w)]. Again using Plancherel’s identity 


Isl = 1 oy t= 2 [11 - 6) (7160) Fae 


< ess sup |1 — CC) [* Ig Ie? 


(11) 


Clearly, ||(I — C)™ |, S ess sup | 1 — C(w) |. 


According to the definition of the essential supremum of a function, 
for any 6 > 0 there exists a set of values of w of nonzero measure such 
that 


{1 — C(w) |" > ess sup | 1 — C(w) |" — 6. 
Since | G(w) | is permitted to vanish only on the complement of such 
a set, it is evident from (11) that 

| (I - C)™ |p 2 ess sup | 1 — C(w) [* — 4 


for any 6 > 0. Thus, in view of the upper bound on || (I — C)™ |2, 
|| (I — C)™* |lp = ess sup | 1 — C(w) |7. 


A similar argument shows that |] C |l2 = ess sup | C(w) |. This com- 


pletes the proof of Theorem I. 


V. PROPERTIES OF SOLUTIONS TO 82 = g[Cs2 + 8] 


Theorem IIT: Let c(t) € Lr, C(w) ¥ 1, and let gfv] = v + gv] be as 
defined in Section 3.1. Let gf] be continuous with respect to t on the com- 
plement of = whenever f © K(Z). Suppose that 


[| (L— Cy |Jeo- I] C lo max (J1 — a@|,|6-1|])=r<1. 


Then for any 8s, € K(Z), there exists a unique s. © K(Z) such that sz = 
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g[Cs. + 8]. Furthermore sz = lim son where 


N?O 


Sm = (I — C)"G[Coxmay + 1] + (I - C)'s: 


and Se 1s an arbitrary element of K(Z). The nth approximation so, satis- 
Jies 


nr 


| sen ~ 82 [lo S — ll «1 — 820 [lo 





Before proceeding to the proof of Theorem II we state the analogous 
result for the space Loe . 
Theorem IIT: Let c(t) ¢ Lar, sup |C(w) | < ©, inf|1 — C(w) | > 0, 


and let g[v] = v + ¢[v] be as defined in Section 3.1 Suppose that 
| C— C)* fp- C hh max (Il — al, [B-1per<h 
Then the conclusion of Theorem II follows with K(Z) replaced with 
Loz and with the K(Z) norm replaced with the Lor norm. 
5.1 Proof of Theorem Il 
We have 
So = Cs. + 5, + g[Cse + 8] 


and hence, in view of the first part of Theorem I and the assumption 
that C(w) ¥ 1, s. = Lsy where 


Lsg = (I — C)“g[Cse + s] + (I — C)s,. 


It is evident that L is a mapping of K(~2) into itself. We shall show that 
under the conditions stated in the theorem L is in fact a contraction 
mapping of K(2) into itself. Let f, g © K(Z) and observe that 


| Lf — Lg |]. = || @ — C)*falCf + si] — o[Cy + si} |]. 
< || I - C)* Jn |] Cf + si] ~ A Cg + 51] II. 
and that 


|| Cf + sil — glCy + 1] lo. 


glCf + si] — ¢lCg + si] 
( Cf — Cy -1)er— a) |, 


max (|1 — a|, | 8 — 1]) |] Cf — g) lle 
max ([1 — a|,|8—1]) |] Clo] — g lle. 








IIA 


IIA 
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Thus L is a contraction when r < 1. This proves Theorem JI with the 
exception of the last inequality* which follows directly from the fact 
that s. can be written as 


So = Seq + 2, [so¢5-41) — $2,] (12) 
j= 


in which for all7 2 1 


Il Seci41) — S23 [leo = |] Lsey — Lsecz-y lo S 7 || se3 — Segry lle. 


5.2 Proof of Theorem IIT: 
With obvious modifications the proof of Theorem II suffices. 
VI. A CASCADE GRAPH FOR APPROXIMATELY DETERMINING THE SIGNAL 
S2 IN FIG. 3 


Suppose that the input signal s, in Fig. 3 is an element of K(2). 
Then under the assumptions stated in Theorem II, the output signal s. 
is an element of K(Z) and is given by (12) where 899 is an arbitrary 
element of K(2). The key inequality that must be satisfied ist (using 
the first part of Theorem I and assuming that c, h e D) 


[tet jar(a + [imo |ar) es 
-max ({1 —a@|,{@—1]) =r <1. 


If we take s. = (I — C)~'s,, the sum ), [saj41) — 82] represents the 
j=0 


nonlinear distortion present in the output. The first term in this series is 
(so — 80) = (I — C)“a[(I — C)“si] 


and, using the inequality in Theorem II, a bound on the error incurred 
in ignoring the remainder of the series is given by 


i] 


(sox = Soo) = De [Socj41) —= S2;] 
j=0 











- = || sar — 82 [loo 


r 
1l-—r 





IIA 


| Se1 — S20 \|0 5 


* Note that when s29 = 0, this inequality implies that 
I 82 Ilo = (1 —_ 0 as I S21 ll. . 


} In physical systems both integrands vanish for negative arguments. 
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That is, the error in ignoring the remainder of the series is at most 
r(1 — r)” times the norm of the first term. Thus if r is sufficiently 
small the function (sx — seo) is a good approximation to the distortion 
component of the signal s.. Fig. 4 shows the corresponding flow graph 
for determining s , the approximation to the output signal in Fig. 3. 
Theorem III leads to analogous results and the same flow graph for the 
case in which signals belong to £22. The essential difference is that in 
the Lor case attention is focused on the energies of the signals. 


6.1 Relation of the Graph in Fig. 4 to a Well-Known Engineering Technique 


The flow graph in Fig. 4 characterizes the essence of a well-known 
engineering technique”’’’ for approximately determining the effect of 
feedback on nonlinear distortion introduced in one stage of an amplifier, 
when the distortion is ‘“‘small.”’ In particular, observe that if, as indicated 
in Fig. 4, u(t) denotes the distortion* produced by the open-loop system 


I 


(I-e)" 
Sy ? uo (I-C)" Sar 


Fig. 4 — Cascade flow graph for approximately determining the output signal 
in Fig. 3 (sa is the approximation to s2). 


with the same ‘small-signal transmission” and the same output stage 
as the feedback system in Fig. 3, then the output distortion in Fig. 3 is 
approximately (I — C)~‘u. In engineering terms, feedback is said’’*’ to 
reduce the nonlinear distortion by the amount of the “return difference” 
[1 — C(w)] fie., by the formal frequency domain representation of the 
operator (I — C)]. 


6.2 Comparison with Desoer’s Results 


In an interesting paper’ Desoer has considered the range of validity 
of the graph in Fig. 4 for the case in which ¢(v,f) = v + ev”, where e 
and m are real constants with m an odd positive integer. In his discussion 
all signals belong to K(2) with = the null set. He considers the analysis 
of a feedback system of the type shown in Fig. 2(a) and argues that 
if the norm of the input to the amplifier is sufficiently small and if 
| e | is sufficiently small,f then the distortion component of the output 
~ * The “distortion generator’’ referred to in the usual engineering arguments 
produces the signal u(t). 


+ This writer feels that some additional restriction on | « | is necessary in order 
that Desoer’s condition (B) be satisfied. It would suffice to assume that | e€ | < 3/16. 
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signal is given by the sum of an infinite series in which the norm of 
each term is less than one-fourth the norm of the preceding term. When 
the system is characterized by a flow graph of the type shown in Fig. 3 
[as indicated earlier, the analysis of the seemingly more complicated 
situation in Fig. 2(a) can be reduced at once to a consideration of this 
type of graph], the first term in the series is the distortion determined 
from Tig. 4. 

Desoer focuses attention on simplifications that can be exploited in 
cases of engineering interest. According to him it is a matter of experi- 
ence that for a typical low-pass feedback amplifier |] C ||. = —C(0) 
[observe that in general || C ||.. = sup | C(w) |]. In addition he presents 


a, heuristic argument to support the claim that in such amplifiers || (I — 
C) ||,, is approximately equal to 2. With 


ee [ ila faa OA eS: 


the condition that r [in (13)] be less than 14 (i.e., the condition cor- 
responding to Desoer’s criterion for determining the applicability of the 
graph in Fig. 4) is 


Steet 
BIC |. 


It is a routine matter to show that in the high loop-gain case (i.c., 
the case of principal engineering interest) (14) is a much less stringent 
condition on the permitted degree of nonlinearity than that implied by 
Desoer’s upper bound on | ¢ | and his input norm bound. For a loop gain 
of 100 fie., C(O) = —100] and m = 3, the bound on | e | is such that 
(14) permits any deviation from unity of the slope of y(v) [i.e., max 
(|1 — al, |B — 1|)] which does not exceed 2,500 times the permitted 
maximum deviation from unity of the slope of v + ev° over the operat- 
ing range*} implied by Desoer’s input norm bound. 


max (|1 — a@|,|8— 1 |) < (14) 


6.3 An Extension of Theorem II 


Note that when the loop gain is large the permitted amount of non- 
linearity in (14) is quite small. Although it is not clear whether the 
range of validity of the graph in Fig. 4 can be substantially increased, 
it is a simple matter to show that the iterates s2, defined in Theorem II 

* In the notation of Ref. 1, the signal input to the nonlinear element is z — u6f 
and ||[¢ || S$ #iloill Ss lel 


Desoer’s bound on | e | can be considerably improved in the large loop-gain 
case by assuming that ¢ lies within a ball of much smaller radius (such as || uw ||~!). 
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converge to the unique s. e K(2) that satisfies so = ¢g[Cs. + s,| even if 
r = 1, provided that 


|| C(I — ©)" |]. max ({1 — a], |8—1]) <1. 


This follows from the contraction-mapping fixed-point theorem and 
the fact that the relation between s; and s. can be written as s; = s, + 
g[C(I — C)~'s; + s,] where s3; = (I — C)so. 

In the next section we consider an alternative, but closely related, cas- 
cade graph for determining the output distortion in Fig. 3. For a given 
loop gain (assuming it is large) the alternative graph is valid for much 
larger amounts of nonlinearity than that indicated in (13) or (14). 


VII. ADDITIONAL RESULTS RELATING TO THE EQUATION So = g[Cso + 3)] 


In this section we assume that there exists a function y such that 
Y[e(x)] = x for all real x and ¢. Hence ¥[s2] = Cse + s,. Specifically 
Definition: Let p(x,t) = x + P(2,t) be a real-valued function of the real 
variables x and t such that ~(0,t) = 0 for all t, and that for all t and all 
e2y 

v(x — y) S ¥(2,t) — (yt) S o(@ — y) 


where y and o are real constants. 
Theorem IV: Let p|f| be continuous with respect to t on the complement of 
= whenever fe K(Z). Let c(t) eLir, C(w) ¥ 1, and suppose that 


| (I— C)~ Jo max (JL -—y|,Je-1))=¢@<1. 


Then for any s, € K(Z), there exists a unique s. € K(Z) such that p[so] = 
Cs. + s,. In fact, s. = lim Sen where 


Son = —(I ~ C) “P[Sxn-y] + (I~ C) a 
and Sx) ts an arbitrary element of K(Z). The nth approximation Se, satis- 
jies 
|| 82 — San lle S = || Sex — Su [lo « 


The analogous result for the space Loz 1s 
Theorem V: Let c(t) € Lor, sup |C(w)| < ©, inf |1 — C(w)| > 0, 


and suppose that 
(I — C)“ 2 max ([1—y|,Je-1))=a<1. 
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Then the conclusion of Theorem IV follows with K(Z) replaced with Lor 
and with the K(Z) norm replaced with the Lr norm. 
7.1 Proof of Theorem IV 


In view of the first part of Theorem I and the assumption that C(w) ¥ 
1, the equation yY[s.] = Cse + s,; can be written as sy = Ms., where M 
is the mapping of (2) into itself defined by 


Ms, = —(I — C)"P{s] + (I — C)7’s,. 


Thus, Theorem IV follows from the contraction-mapping fixed-point 
theorem and the readily verified fact that 


|Mf—Mg |. allf—glle, a<i 
for any f,g © K(Z). With obvious modifications this argument suffices 
to establish Theorem V. 
7.2 A Cascade Graph for Approximately Determining the Signal se in 
Fig. 3 when || C ||.. Is Large 
As in the discussion of Theorem II, if we set 3 = (I — C)~’s, in 


Theorem IV, 


fo a] 
8S. = S29 + 2d [Socj41) — Soy], 
j= 


ive] 
in which >> (89341) — 32;] represents the nonlinear distortion component 
j=0 


of the output signal in Fig. 3. The first term in this series is 


(8 — 5) = —(I — C)“¥[(I — C) si], (15) 
and the error incurred in ignoring the remainder of the series 
(ic. (4 — 5) — dS [kaw — Bil is, as in Section VI, at most 
j=0 oe) 














q(1 — q) times the norm of the first term [(i.e., of (15)]. 
Thus if 
q = || (I~— C)™ |]. max ({1 — y|,|o — 1]) (16) 


is sufficiently small, (8: — S29) is a good approximation to the distortion 
component of so. Fig. 5 shows the corresponding flow graph* for de- 
termining Sq, the approximation to the output signal s. in Fig. 3. 


* An analogous interpretation of Theorem V leads to the same flow graph. 
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Fig. 5 — Alternative cascade flow graph for approximately determining the 
output signal in Fig. 3 (8 is the approximation to 82). 


Observe that this graph differs* from the one in Fig. 4 only in that @ 
is replaced with —y~. However the expression for g in (16), unlike the 
corresponding expression for r in Theorem II, does not contain the 
factor || C ||., which is a measure of the amount of feedback present 
in the system (as indicated earlier, in low-pass feedback amplifiers 
typically || C ||. = |C(0)]). 

The condition that ¢ in (16) be less than 14 when |] (I — C) "|. = 
2 (i.e., the condition corresponding to Desoer’s practical criterion for 
determining the applicability of the graph in Tig. 4) is clearlyt 


max ({1 —y|,|¢—1]|) < 1/8. 
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APPENDIX 


The following inequality, in which 9t denotes either K(Z) or Lex, can 
be used in some cases to compare the output signals in Figs. 4 and 5 





| Qal/] + VIII] S$ HS | OI I (17) 


where f is an arbitrary element of 9, || - || denotes the norm for the 
space 9t, and Q is any linear operator defined on 9% such that 


HQ] - |} Q* max (Jl —yv],J/o-—1]) =p <1. 


* In the Appendix a general inequality is presented which in some cases can be 
used to bound the norm of the difference between the two approximations to the 
distortion component of se , (821 — S20) in Fig. 4 and (821 — S20) in Fig. 5, in terms 
of the norm of (8:1 — 520). The inequality is not applicable when || C ||,, is large 
unless max (| 1 — y|,|o¢— 1] ) is sufficiently small. 

+ It is sometimes desirable to consider the unrealistic and very much simpler 
situation in Fig. 8 when C represents multiplication by a real constant c. In that 
case the relation between s; and sz can be written as so = c71y[se] — c™1s, , from 
which it is evident that, provided | c | is sufficiently large, the contraction-map- 
ping fixed-point theorem is applicable with a small contraction constant even 
when y represents a highly nonlinear element. 
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Inequality (17) follows from the fact that g = Qgf[f] is the fixed point 
of the contraction-mapping N (with contraction constant p) defined by 


TA! 

Ng = Qf — QYIQ gl. 
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A Three-Conductor Elementary Clogston 
Coaxial Transmission Line— Calculation, 
Fabrication and Experiment 


By J. M. MANLEY 
(Manuscript received April 26, 1963) 


Eddy current losses tr a conductor of a transmission line can be reduced 
af the conductor can be divided into two parallel shells and the current divided 
evenly between these two. According to the Clogston theory, this may be 
achieved if the dielectric constant of the space between these two shells is less 
than that of the lines’ main dielectric by the right amount. 

This paper describes the calculation of the properties of a three-conductor 
Clogsion line, zits fabrication, the measurement of its allenuation, and addi- 
tional calculations relating to the termination of the line. 

The calculations indicate that with a line having this structure attenua- 
tion can be reduced 24 per cent at 4. me and about 15 per cent at 1 mc and 
10 me from that of a conventional coaxial line having the same dimensions 
as the one studied here. A reduction of 18 per cent instead of 24 per cent was 
realized in the 600-foot length of line which was built. About one-half of 
this difference 1s a result of the deal ratio of dielectric constants not having 
been attained in this first attempt. 

It was found that ideal termination of the line is difficult. However, a 
simple approximation to an rdeal termination yields an insertion loss for 
the line only a small amount larger than the ideal, provided the length of line 
is such that tts loss is about 20 db. 


I. INTRODUCTION 


A three-conductor coaxial cable, in which the locations of the two 
inner conductors are transposed at regular intervals, was built and shown 
to have about 20 per cent lower attenuation over a certain frequency 
interval than a two-conductor coaxial of the same size.! This is because 
the eddy current losses are smaller, and this in turn is a result of the 
normal central conductor having been divided into two and the normal 
central current having been divided nearly evenly between these two. 
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It is possible to do this so that the two conductors occupy approximately 
the same space as the single one did before, since the skin effect causes 
most of the current to flow in a thin shell at the outer periphery of the 
inner conductor over a certain frequency interval. Dividing the central 
conductor provides two shells for current flow. 

One way to achieve equal currents in the two inner shells is to trans- 
pose the two inner conductors frequently enough in the manner of litz 
wire and so force the even division. This is the method used in the project 
described in Ref. 1. Another method is to so choose the dimensions and 
properties of the two dielectric spaces that the line has a natural mode 
of propagation with approximately the desired current distribution. This 
is a special case of the method suggested by Clogston a number of years 
ago,? namely, that the conductors of a transmission line be made of many 
laminations with proper dielectric separation. Other work has also been 
done on the three-conductor case.’ It is also possible to obtain the de- 
sired current distribution by having the inductances associated with the 
several spaces in the proper ratio. This has been done in experimental 
lines built in Japan.4 The proper values of inductance are achieved by 
having spiral gaps of increasing pitch as the center of the cable is ap- 
proached. 

The Clogston method which is used in the experiment described in this 
paper depends on having the proper ratio of dielectric constants in the 
several spaces. Fig. 1 is a pictorial representation of the Clogston line 
built for our experiment. This line is uniform with respect to length, in 
contrast with the transposed line. Fig. 2 is an enlarged photograph of 
the central member of the cable. 

Following Clogston’s work on laminated conductors for transmission 
lines,? an experimental line having 100 laminations was built.° Measure- 
ments on this cable did not show the expected reduction of attenuation.§ 


COPPER 

COPPER 
CONDUCTOR 3 CONDUCTOR 2 
>... COPP 
CONDUCTOR 1 











rene 
hs WITH AIR SPACE 
) DIELECTRIC 4 
POLYETHYLENE 
DIELECTRIC 2 


Fig. 1 — Drawing showing construction of three-conductor Clogston trans- 
mission line. 
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Fig. 2 — Enlarged photograph of two-conductor central member of Clogston 
line. 


It was suspected that nonuniformity in spacing and thickness of lamina- 
tions had a part in the discrepancy, and some calculations of this were 
made.® But the discrepancy was not resolved until the work’ by Gordon 
Raisbeck was carried out. Meanwhile, it was decided to build the 
simplest laminated line, namely one in which the central member con- 
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sisted of two conductors instead of one. The first result of this program 
was the transposed line.1 The next step was the three-conductor Clog- 
ston line project reported in the present paper. 


II, CALCULATION OF THE PROPERTIES OF THE THREE-CONDUCTOR CO- 
AXIAL LINE 


The transmission properties of a three-conductor coaxial structure 
were derived in the paper of Ref. 1 by adapting Schelkunoff’s general 
results’ to this case. Only the necessary results will be repeated here. 

Sections of the line are shown in Fig. 3. This illustration and the nota- 


Tae by 


(i, +2) 








Fig. 3 — The three-conductor Clogston line: cross section and schematic longi- 
tudinal section showing surface impedances. 


tion used here are the same as in Ref. 1. The conductors are numbered 
1, 2, 3 beginning at the innermost. The currents flowing in these con- 
ductors and the voltages on them are designated 2 , 72, 73 3 U1, Ve, Vs in 
same order. It is seen that 


—%3 => ml + ty and 
v3 = 0. 


The behavior of the line is described by the two homogeneous equa- 
tions 


[Zu — ¥/Vila — Za(ti + 2) = 0 


; ae (1) 
—Zavli + [Zoe — ¥/Vel(t1 + t2) = 0 
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and two others 


ve — wy = —yh/N (9) 
bo = y(t + t2)/Ve2 
where 
Zu = Za? + Loa + 5Xe1 (3) 


is the effective series impedance of the simple coaxial line consisting of 
conductors 1 and 2 only and where 


Zo = Vs + Vie + aX ins (4) 


is the effective series impedance of the simple coaxial line consisting of 
conductors 2 and 3 only. In these expressions Z,°”” and Zya°”” are the 
outer and inner surface impedances, respectively, of conductor m, using 
Schelkunoff’s concept and notation described in Ref. 8. The surface 
impedances are fictitious coefficients which gather the impedance effects 
of a conductor at its surfaces. Za,” is the surface transfer impedance of 
conductor 2; y is the propagation constant. Also 


jXa, = ju 5 log (a2/b1) 


Qrweyz 


ae log (a2/b1) 


and 
. A Ww 
jXa, = jus log (as/b2) 
Tv 


2m WEQ 


* I Tog (as/b2) 


are the series impedances and shunt admittances of the two dielectric 
spaces between conductors 1 and 2 and between conductors 2 and 38, 
respectively. All these coefficients are per unit length of line. 

The propagation of voltages and currents in the natural modes of the 
line are governed by the four values of y which alone permit the two 
homogeneous equations (1) to be satisfied. These values of y are roots 
of the characteristic equation 


Y (5) 


Hy Saw VoFe) Vio Fut Ga y= 0 (6) 


obtained by setting the determinant of coefficients of the homogeneous 
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equations equal to zero. The roots occur in pairs, so that we have 


| ios 64 
Y Y (7) 
Ys = —~Y4- 

Tor the dimensions considered here, y: and y3 are associated with 
low- and high-loss modes respectively. 

If it is assumed a priori that the currents 2, and 72 in the two lamina- 
tions are equal, and if the dimensions and impedances of the three- 
conductor cable used for the transposed line are taken, then it is found 
that attenuation of the low-loss mode is reduced about 30 per cent below 
that of a comparable two-conductor coaxial. However, these assump- 
tions require the ratio of dielectric constants to be complex, viz. 


ép/é = 1.23 — 7 0.08 (8) 


which implies dissipation in at least one of the spaces. This would 
introduce additional loss which has not been considered. 

Instead of doing further analytical work along these lines, we resorted 
to numerical computation of the line properties, using again the di- 
mensions of the uniform three-conductor cable made for the transposed 
line (OD about 0.18 inch), but with real values for the ratio «/e. 
The values of attenuation constant versus ratio «/¢, of dielectric con- 
stants for several values of spacing between conductors 1 and 2 found in 
these calculations are plotted in Fig. 4. With these real values for the 
ratio ¢/€2 , there is an appreciable (about 24 per cent) reduction of at- 
tenuation at the minimum. Near this minimum, the ratio of currents in 
conductors 1 and 2 is nearer 1.2, as may be seen in I’ig. 5, where the ratio 
of voltages is plotted also. It will be noticed in Fig. 4 that the curve of 
attenuation versus «/¢: is considerably sharper where the lamination 
spacing is 15 mils instead of 5 mils. In Fig. 6 the attenuation constants 
a, and a3 of the low- and high-loss modes, respectively, and also that of 
a reference two-conductor coaxial, are plotted versus frequency. This 
reference coaxial has the same size outer conductor as that of the special 
cable and a solid inner conductor with an OD very nearly the same as 
that of conductor 2; the dielectric is specified by e.. The shape of the 
x, curve is similar to that for the transposed line, except that attenuation 
at low frequencies approaches that of the reference two-conductor co- 
axial, whereas attenuation in the transposed line was somewhat higher. 
The phase distortion of the three-conductor Clogston and the two- 
conductor reference line are plotted in Fig. 7. For further comparison of 
the Clogston line, attenuation and phase distortion of the two-conductor 
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Fig. 4 — Calculated attenuation of three-conductor Clogston line. 


coaxial consisting of conductors 2 and 3 — that is, two concentric 
tubes — are shown in Figs. 6 and 7, respectively. 

Other properties of the line are shown in Figs. 8, 9, 10 and 11. Z; and 
Zo are the characteristic impedances between conductors 1 and 3 and 
between 2 and 3, respectively, for the low-loss mode with propagation 
constant 71. For the high-loss mode, the corresponding impedances are 
2, and z.. These are the ratios of voltage and current amplitude for each 
of the natural traveling waves which can appear on the line. This may 
be seen clearly when the total line currents and voltages are expressed 
in terms of the natural traveling waves which are the solutions of the 
line equations (1) 


t(%) = tine” + tye 7 + ige”* + tne 
12(2) = dye" + Tooe *'* =- 4320” + ie 
v(x) = ve + ve 7 + ge" + vge 7 


v2(x) = v.67" + vee ™* + vg9e7” + vgoe 78”, 


(9) 
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Fig. 5 — Calculated current and voltage ratios in three-conductor Clogston 
line. 


The first subscript refers to the mode and the second to the conductor. 
For example, 7; is the current associated with wave of mode 1 in con- 
ductor 1, and 2 is that for the wave of mode 2 in conductor 1. Mode 2, 
as noted in (7), is the backward traveling wave with propagation con- 
stant — yi. Thus, modes 1 and 2 are the forward and backward waves 
having low attenuation y,; and modes 3 and 4 are the forward and 
backward waves having high attenuation +3. From these facts, we have 
the following relations 
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VL Vat 
ZA a — 
tu tar 
V12 V22 
Z= = 
412 222 
031 V41 
21 — _ FS er 
431 tar 
(10) 
032 V4 
£2 See ee, es 
232 142 
Vio V22 
hoe = 
Vit Vo 
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Fig. 6 — Attenuation constants of three-conductor Clogston line compared to 
those of two two-conductor lines of the same dimensions. 
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Fig. 7 — Phase distortion of three-conductor Clogston line compared with 
those of two-conductor coaxials of same dimensions. 


The natural ratios of voltages on the two conductors for any one travel- 
ing wave are not the same as the ratios of impedances, e.g., 04/02. 4 
Z:/Z,. The voltage and current ratios were calculated by substituting 
the values of 71 and 73 into (1) and (2). They are plotted in Figs. 9 and 
11. From these figures it is seen that the current associated with the 
high-loss mode (propagation constant 3) in conductor 2 is nearly equal 
to that in conductor 1 but flows in the opposite direction, so that very 
little flows in the outer conductor 3. Thus, the current that flows in the 
traveling waves of this mode does not divide more or less evenly between 
the two inner conductors as it flows down the line, as required for re- 
duction of eddy current losses, but uses these two conductors mainly 
as go and return paths. In other words, looking at this Clogston line as a 
two-conductor structure, current in this mode is essentially a circulating 
current in one of the conductors. It is also seen that below about 2 me 
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the power at this mode has a fairly large reactive component. The high- 
loss mode is thus undesirable for other reasons than its order-of-magni- 
tude higher ioss. 

The relations and figures of this section will be referred to again in 
Section IV when the subjects of termination and measurement are taken 
up. 


II. MAKING THE THREE-CONDUCTOR CABLE 


After considering a number of ideas and making several trials, it was 
decided to form the inner dielectric by spiraling a polyethylene thread 
around the inner solid wire as shown in Fig. 2. This method, suggested 
by Gordon Raisbeck, seemed to be the most suitable for our experiment. 
If the thread has a circular cross section, calculations showed that the 
desired effective dielectric constant is achieved if the threads nearly 
touch. 
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Fig. 8 — Components of characteristic impedances of conductors 1 and 2 (with 
conductor 3 as return) of three-conductor Clogston line for low-loss mode. 
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Fig. 9 — Current and voltage ratios in three-conductor Clogston line for low- 
loss mode. 


One problem here was the effect on this insulation of the heat neces- 
sarily present during the extrusion of the main dielectric over the 
composite consisting of conductors 1 and 2 and the dielectric between 
them. This problem was solved by a modification of the normal extrusion 
process worked out by members of the chemical research laboratory who 
did this work for us on their experimental extruder. 

The two-conductor composite central member of the cable was made 
in two trips through a small-scale cable making machine. In the first, 
5-mil polyethylene thread was wound around a 19-gauge copper wire as 
it moved through the machine. In the second trip, 1-mil copper tape and 
the insulated wire were pulled through a forming die at the center of 
the spinner so that the copper was wrapped around the insulated wire, 
leaving a longitudinal gap about 4 mils wide. The copper was held in 
place by a spiral wrapping of 2-mil thick polyethylene tape. The correct 
spacing of thread was determined by measuring capacitance and calcu- 
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lating dielectric constant for several pieces a few feet long. A linear puller 
was used instead of a circular capstan to avoid bending this composite 
conductor. The puller consisted of two rubber-toothed belts and a 
suitable mounting and speed control. 

The copper tape used for conductor 2 was inspected and cleaned be- 
tween Teflon blocks. Even so, a few tiny bits of copper came off and 
lodged between turns of thread. This was detected by an alarm system 
rigged to give an indication as soon as a low resistance developed between 
conductors 1 and 2 during the forming process, so that the machine could 
be stopped and the trouble sought and removed. 

An enlarged view (about 8X) of the composite two-conductor center 
is shown in Fig. 2. The finished form of the structure is at the right, 
where the polyethylene thread may be seen through the gap between 
the edges of the copper tape. To the left of this, where the unwrapped 
polyethylene tape no longer binds it, the gap in the copper tape is 
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Fig. 10 — Components of characteristic impedances of conductors 1 and 2 (with 
conductor 3 as return) of three-conductor Clogston line for high-loss mode. 
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Fig. 11 — Current and voltage ratios in three-conductor Clogston line for high- 
loss mode. 


larger. Small pieces of pressure-sensitive fabric tape have been used to 
keep the polyethylene thread and tape from unraveling. 

Calculations of the dielectric constant of the space between conductors 
1 and 2 of this composite were made from capacitance and dimensional 
measurements; they indicated a value of 0.76 for «/e2 instead of the 
intended 0.81. The principal reason for the difference appears to be in- 
sufficient control of thread spacing and copper tape binding tightness 
throughout the whole length of the conductors. Weighed against the 
difficulties of, and the time required for, making a new central element, 
the difference between attenuation for 0.76 and that for 0.81 seemed 
small enough to warrant going ahead with the experiment. 

After some preliminary runs, the full length of the central member 
(about 600 feet) was put through the chemistry laboratory’s extruder to 


THREE-CONDUCTOR CLOGSTON LINE 2565 


make the insulation between conductors 2 and 3 of the cable. This was 
not entirely successful, since examination showed that the outside diame- 
ter of the polyethylene, which was supposed to be 170 mils, actually 
varied from 150 to 190 mils. After due consideration it was concluded 
that this was an unavoidable consequence of the modified extrusion 
process. In order to avoid the uncertain effects of this diameter variation, 
a machine was made which shaved off excessive polyethylene as the line 
was pulled along so that the outside diameter now is 151 mils within 
less than 1 mil. Examination of the inner part of the line at this point 
showed that no polyethylene had been forced into the space between 
conductors 1 and 2 during the extrusion process. 

The outer conductor of the cable was made by forming 4-mil copper 
tape over the extruded polyethylene. This conductor is held closed with 
a small overlap along the longitudinal seam by a spiral wrapping of 
pressure-sensitive fabric tape. 

The length of the cable was carefully measured to be 539.8 feet, or 
164.3 meters. The dimensions of the composite structure — conductors 
1 and 2 — are difficult to obtain accurately. The best estimates of these 
dimensions of the line as built, using the terminology of Tig. 3, are (in 
inches) 


2a. = 0.0466 2a3 = 0.1515 


2b. = 0.049 2b3 = 0.1595. 


Using these dimensions and the measured values of capacitance of the 
two dielectrics, we find that 


érg = 2.2, én = 1.67, e/e2 = 0.76. 


IV. TERMINATION 


Because the answers to the question ‘‘“How do we terminate the 
Clogston line?” are complicated, let us try to see first just what is meant 
by proper termination. 

If a two-conductor line with its two natural traveling waves, or modes, 
is terminated with the characteristic impedance for the forward mode 
(i.e. Z; = vy /%1), only the forward traveling wave exists on the line and 
the attenuation constant of the forward natural wave is the attenuation 
constant of the transmission line. If the termination. is other than Z, , 
some of the backward natural wave is also present, a situation which 
may be described otherwise as involving reflection or nonuniform flow 
of energy. 

In the three-conductor line, there are four possible natural traveling 
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waves, and the total currents and voltages on the line are made up of 
various amounts of these, as indicated by (9). Consideration of these 
equations along with (10) shows that if the inner conductors 1 and 2 are 
terminated with the characteristic impedances Z; and Z2 respectively of 
the low-loss forward traveling waves and the voltages on these two con- 
ductors established in the ratio natural for these waves, then the total 
currents and voltages in the line will consist of these waves alone. The 
attenuation of the line will then be that of the low-loss modes, as shown 
in Figs. 4 and 6. This is the situation we desire, but which was found 
difficult to achieve experimentally. 

In the case of the transposed line, the voltages between the two inner 
conductors and the outer are the same for the low-loss mode and opposite 
in sign for the high-loss mode. Thus, the proper way to launch and 
terminate the line for propagation at the low-loss mode while eliminating 
the high-loss mode is also the simplest procedure: i.e., to connect together 
the two inner conductors at both ends of the line. 

The situation is not as simple for the elementary Clogston line. The 
results of calculations plotted on Figs. 8, 9, 10 and 11 show that the 
voltages of the two inner conductors with respect to the outer are not 
the same for the low-loss traveling wave. Further, they show that the 
ratio of these voltages is not the same as the ratio of the characteristic 
impedances of the two inner conductors when operating in the low-loss 
mode. 

Terminating the Clogston line by connecting together conductors 1 
and 2 at the ends and using an average characteristic impedance intro- 
duces an unknown proportion of current at the high-loss mode; but the 
procedure is simple, so it was tried. The attenuation at 1 me was about 
50 per cent higher than expected and about equal to the expected value 
around 10 me. Further, there were bumps in the attenuation curve at 
the low-frequency end, indicating the presence of reflections or some 
other combination of natural modes. This method is taken up again in 
the next section, where the results of a search for terminating methods 
for practical use of the line are given. 

The present purpose, however, is to see if the line performs as pre- 
dicted by theory, and since the most interesting aspect of the theory is 
the low-loss mode, we tried to establish transmission via this mode alone. 

Finally, the method used to drive and terminate the Clogston line 
shown in Fig. 12 was arrived at. It will be noticed that at the receiving 
end of the line the two inner conductors are terminated separately. The 
two-terminal ends of the networks could not be paralleled at both send- 
ing and receiving ends of the line without the use of some sort of hybrid 
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Fig. 12 — Circuit arrangement for terminating and measuring attenuation of 
three-conductor Clogston line. 


or bridge structure; otherwise a transmission loop is formed which makes 
it very difficult if not out of the question to satisfy all the terminating 
requirements. 

This arrangement, while measuring the attenuation through conduc- 
tors 1 and 2 separately (which of course should be the same), was ade- 
quate for the purpose of testing line performance. 

The T and LZ networks of Fig. 12 were designed to match the im- 
pedances Z, and Z, of the line and to supply the line input so that the 
ratio of voltages on conductors 1 and 2 was 2:/v12. No attempt was 
made to obtain a design which would provide a match over the fre- 
quency band of interest following the data of Figs. 8 and 9. Instead, the 
networks were made resistive and a separate set was made for each of 
about eight frequencies. Also, values for Z,, Ze and v/v;2 modified 
from those shown in Figs. 8 and 9 were used. The latter values were for 
the expected outer diameter (170 mils) of the cable. When the uneven 
outer polyethylene had to be shaved down to 151 mils in order to obtain 
a uniform OD, a complete new calculation was not made; only the 
change in a few parameters was found, and this used as a basis for 
estimating the others. The impedances were verified by using a pulse 
signal and observing for reflections as terminations were varied. 

Using launching and terminating networks designed in this way and 
connected to the line as shown in Fig. 12, the attenuation in both inner 
conductors was measured at several frequencies surrounding the design 
frequency. The quantity actually measured was the insertion loss of the 
line, using a precise transmission measuring set. Some of the data are 
plotted in Fig. 18, where it may be seen that the two curves of each set 
intersect near the network design frequency. They cross and diverge in 
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Tig. 13 — Attenuation measurements of three-conductor Clogston line with 
arrangement of Fig. 12. 


both directions beyond the design frequency because the networks pro- 
vide launching properties only at this frequency. However, they do give 
the proper value of attenuation in the line at this point. The attenuation 
at each of the intersections is plotted against frequency in Tig. 14 along 
with calculated attenuations for the elementary Clogston line and for 
the reference two-conductor line. The calculated values were obtained 
by adjusting those shown in T’ig. 6 as described previously. 

The curves of Fig. 14 show general agreement between measured and 
predicted attenuation of the Clogston line. Near 4 me the reduction of 
attenuation is about 18 per cent instead of the predicted 24 per cent. 
Part of the reason for this is that, as pointed out in Section III, the ratio 
of dielectric constants is 0.76 and not 0.815, as required for a minimum. 
This reduces the predicted change to 21 per cent. 

It seems reasonable to conclude that an elementary Clogston line con- 
sisting of three coaxial conductors can be built to have a ratio of di- 
electric constants reasonably near the minimum point and to have an 
attenuation close to that predicted. 


V. FURTHER STUDY OF THE TERMINATING PROBLEM 


The method of termination just described would, of course, be com- 
pletely unsuitable for practical operation of the line. For this reason, a 
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theoretical study of the problem of coupling the three-conductor line to 
two-terminal source and load in a practical operation of the line was 
made by Gordon Raisbeck. This unpublished work led him to a recon- 
sideration of transmission properties of the line when the two inner 
conductors are connected together at the ends of the line. 

Briefly, his conclusions are that for a line long enough so that its total 
loss is several nepers, the attenuation is very close to that calculated for 
the low-loss mode of the properly terminated line. Also, the characteristic 
impedance of the line approaches independence of length as length in- 
creases and varies only a small amount with frequency. These facts show 
that for a reasonable length of line the predicted lower attenuation may 
be achieved with a simple resistance termination. 

Using his formulas, the insertion gain of the 164-meter line when con- 
nected between two 50-ohm resistances was computed. This is in fairly 
good agreement with the corresponding measured loss shown in Fig. 15 
when the difference in outside diameters is taken into account. Evidently 
this is too small a length for the method to be suitable. 

Computation from the same formulas of the insertion loss of a 1600- 
meter length of line (total loss about 2 nepers at 4 mc) was made also 
and is plotted in nepers per meter in Fig. 16 along with attenuation of 
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Fig. 14 — Attenuation of three-conductor Clogston transmission line compared 
with a two-conductor line of like dimensions. 
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the low-loss mode alone from Fig. 6. This shows that a line of this length 
with simple termination does have attenuation very close to that of the 
properly terminated line operating in the low-loss mode. The effects of 
imperfect launching and impedance matching for the desired mode and 
the presence of some current in the high-loss mode are thus seen to be 
small and to occur mainly at the low frequencies. While there is no 1600- 
meter length of real line available for measurement, the calculated results 
appear to be reasonable. 

A brief resumé of Raisbeck’s analysis follows. The method used by 
him in this work was to start with a length 1 of the six-terminal, three- 
conductor line whose transmission properties were calculated and 
described in Section II and then to short together conductors 1 and 2 at 
both ends. The matrix describing this new four-terminal structure was 
then worked out and its transmission properties calculated in terms of 
two possible traveling waves and characteristic impedances or, in 
mathematical language, the eigenvalues and eigenvectors of the matrix. 
The derivation of the matrix of the new line and the calculation of its 
properties are fairly involved and will not be repeated here. 
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Fig. 15 — Attenuation of short experimental three-conductor Clogston line 
with simplified termination. 
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Fig. 16 — Attenuation and characteristic impedance of relatively long three- 
conductor Clogston line with simplified termination. 


The two traveling waves are described by the roots \, and A, * of the 
characteristic equation of the new matrix. These roots are given by 


_la 1 a 
waded y/B 4 (11) 


and the characteristic impedance is 


_ =he/a 
"A/T — 4B/a?? 


where the definitions 
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b = (As — As )(— 1+) (2 +h) 
= Oy arya tay (E+) 
C= Oro DOr ss UC = a) 


relate the parameters of the four-terminal line to those of the six-terminal 
line given in (10). In these expressions 

yee A = et he = er (14) 
all the propagation constants y having negative real parts. 

After studying these relations, Raisbeck observed that for a line long 
enough so that the attenuation was several nepers, A, is nearly equal to 
1; Le., the attenuation of the line with conductors 1 and 2 shorted at 
the ends is approximately the same as that of the low-loss mode in the 


properly terminated three-conductor line. 
To see how this comes about, let / be large enough so that 
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The quantity d does not depend on length of line, but on the voltage 
ratios and impedances of the natural traveling waves of the line. It is 
small, being 0.1 at 1 mc, 0.056 at 4 mc, and 0.016 at 10 me, using the 
data of Figs. 8, 9, 10 and 11. Thus a/b is large, and we have very nearly 


and 
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(1 + d) (18) 
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or 
ev’ = e™(1 + d). 


This shows that the attenuation of the line having conductors 1 and 2 
shorted at the ends of a sufficiently long section is nearly the same as 
that of the ideally terminated line. The characteristic impedance of the 
line with conductors 1 and 2 shorted is given by 


Gf, ZiZ> hy -— hs 
Ge (iz) (Fp) +o. oe 


This expression for Z,; is nearly equal to the impedance of Z, and Ze» in 
parallel. 

While the values of a, and Z, plotted in Fig. 16 were calculated from 
the exact values of a, b, and c, the calculations show that for the 1600- 
meter length, the above approximations are quite close to the correct 
values. 

The formula for insertion gain between two resistances /?, as used in 
the calculations, is 


ART, 
1(Z, + BR)? —(Z, — BR) 


2 AR/Z. on 
~ (1+ R/Z.)? 


a 
(20) 


VI. SUMMARY 


Early in the paper, the properties of an elementary Clogston line were 
discussed, and the process of building such a line for experiment de- 
scribed. In testing the line’s performance, it was found that it performs 
about as predicted and that terminating the line ideally is a difficult 
problem. In Section V, it has been seen that Raisbeck’s analysis leads to 
a simple way of coupling the line to source and load with resistances 
which, while it does not provide ideal termination, does allow per- 
formance which is quite close to ideal when the length of line section 
involved has a loss of around 20 db or more. 

Thus we conclude that it is possible to build a uniform three-conductor 
line which has about 20 per cent less attenuation than a comparable 
two-conductor line over a considerable frequency band. Since this is 
almost the same as the reduction obtained with the transposed line, this 
line is an improvement in not requiring transpositions. 

While a line made of two concentric tubular conductors may be the 
best for pulse transmission because of its low phase distortion, the use 
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of this three-conductor Clogston line may be thought of as a good com- 
promise. This is because it provides less phase distortion than a con- 
ventional two-conductor line, although more than the tubular line (see 
Tig. 7), and in addition its solid center conductor permits a low-loss 
path for sending power down the line, and lowers the transmission loss 
considerably in the low-frequency range (see I’ig. 6). 
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Linear Time-Varying Circuits— Matrix 
Manipulations, Power Relations, and 
Some Bounds on Stability 


By SIDNEY DARLINGTON 
(Manuscript received May 17, 1963) 


This paper is concerned with general circuits of linear, time-varying, 
positive, two-terminal components. It describes methods of manipulating 
corresponding matrix (vector) differential equations. It uses the manipula- 
icons to derive equations for power and bounds on stability. The bounds 
apply to the exponential factors associated with the basis functions of 
periodically varying circuits. 


I. INTRODUCTION 


Linear time-varying circuits of special kinds have been designed and 
analyzed with notable success. On the other hand, theoretical techniques 
suitable for more general linear time-varying circuits have been develop- 
ing much more slowly. The development of more general techniques can 
be approached in various ways. One can seek to specialize the pure 
mathematics of linear differential equations, in order to discover the 
properties of those equations which can actually correspond to physical 
circuits. Alternatively, one can seek to apply the classical analysis of 
general dynamical systems. As still another alternative, one can seek to 
generalize, for time-varying circuits, concepts, principles, and techniques 
which have long been applied to fixed circuits. 

This paper illustrates the circuit theory approach. After formulating 
matrix (vector) differential equations corresponding to circuits of linear, 
time-varying, two-terminal components, it describes some general meth- 
ods of manipulation. These apply to combinations of time-varying 
matrices and the differentiation operator, and are time-varying counter- 
parts of manipulations applied to constant matrices in the theory of 
fixed circuits. Thereafter, the paper uses the manipulations to derive 
formulas for power, and some bounds on stability. 

The power equations are conventional and reflect the well known fact 
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that time-varying capacitors and inductors can supply energy to a 
circuit. However, they can be derived in a way which illustrates manipu- 
lation of time-varying matrices in particularly simple terms. Further- 
more, the form of the power equations suggests a starting point which 
leads eventually to the bounds on stability. 

It is assumed throughout the paper that all circuit components (fixed 
or varying) are positive. The stability bounds (as derived) assume that 
the circuits vary in a periodic manner. Then the basis functions can be 
arranged as a set of exponentials, each multiplied by a periodically vary- 
ing coefficient (except in singular cases which are the time-varying 
counterparts of fixed circuits whose frequency functions have multiple 
poles). The basis functions are the counterparts of the ‘natural modes” 
of fixed networks, and they play an equally important role. 

The signs of the real parts of the exponents, in the basis functions, 
determine whether the functions grow indefinitely, or die out. The sta- 
bility bounds derived herein are upper and lower bounds on the real parts 
of the exponents. The specific bounds depend on the composition of 
the circuit — whether it is composed exclusively of resistors and ca- 
pacitors, resistors and inductors, or capacitors and inductors, or in- 
cludes all three kinds of components. For each composition, there are 
two pairs of bounds, corresponding respectively to the node equation 
and mesh equation (which will be defined in Section IT). 

The form of the bounds is illustrated by the following: Let G and C 
be the node matrices of the conductances and capacitances of a period- 
ically varying circuit of resistors and capacitors. Consider the zeros in 
terms of the scalar variable \ of the determinant of (G + 1C + AC is 
If the capacitances are positive at all times, the matrices can be so 
defined that C is positive definite. Then the zeros of the determinant are 
real, and they vary periodically with time. It will be shown that the 
time averages of the instantaneous maximum and minimum over the 
set of zeros are upper and lower bounds on the real parts of the exponents 
in the basis functions. 

The mesh equations lead to similar bounds in terms of the zeros of the 
determinant of (K — 4R + AR), in which K and R are the mesh mat- 
rices of the stiffnesses and resistances of the capacitors and resistors. It 
is well known, on energy grounds, that a circuit of varying positive 
resistors and fixed positive capacitors cannot be unstable. These bounds 
show that, likewise, a circuit of fixed resistors and varying capacitors 
cannot be unstable. It is true even though the varying capacitors can 
give power gain. 
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Similar buunds can be obtained for circuits of resistors and inductors 
only, by a simple transformation of current and voltage variables. 

The bounds for “RC” and “RL’’ circuits are at least reminiscent of 
known bounds on the characteristic roots of a dissymmetrical constant 
matrix. Let A be a real matrix. The largest and the smallest of the char- 
acteristic roots of }(A + A‘) are upper and lower bounds on the real 
parts of the characteristic roots of A itself. 

For a circuit of inductors and capacitors only, let S be the node 
matrix of the reciprocals of the inductances, and let C be again the node 
matrix of the capacitances. Consider the zeros of the determinants of 
the two matrices (LC + AC) and (—48 + AS). Treat the two sets of 
zeros as a single set of numbers. Then the time average of the maximum 
over the set is an upper bound on the real parts of the exponents in the 
basis functions, and the time average of the minimum over the set is a 
lower bound. The mesh analysis leads to similar bounds, except that the 
pertinent matrices are now (41 + XL) and (—3K + AK), where L and 
K are the mesh matrices of inductances and stiffnesses. 

In some ways, the bounds for circuits of inductors and capacitors are 
less satisfactory than those for circuits of resistors and capacitors. When 
the inductors and capacitors are fixed (and positive) the damping is 
necessarily zero. When they are varying the damping may or may not 
be zero. The bounds derived here generally bracket zero damping. Thus 
they do not say whether or not a time-varying circuit of inductors and 
capacitors has any basis functions with damping different from zero. 
They merely say that if the damping is different from zero it is at least 
within the bounds. 

Similar bounds are easily established for circuits of all three kinds of 
components. However, they tend to be weaker than the bounds for 
circuits of two kinds of components only, in ways which will be explained. 

Some of the derivations have not been completed to the extent of 
proving validity for all singular, as well as normal, situations. What is 
reported here is the result of exploratory studies of time-varying circuits, 
which still leave some details to be filled in. 

The author is indebted to I. W. Sandberg, whom he has consulted 
freely concerning properties of matrices in general and of positive definite 
and semidefinite matrices in particular. The circuit and power equations 
are formulated more carefully by C. A. Desoer and A. Paige.’ However, 
their analysis tends more to pure mathematics and less to the theory of 
fixed circuits. They do not include the circuit theory type of manipula- 
tions or the stability bounds, which are the primary concerns of this 


2578 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


paper. H. E. Meadows has noted somewhat similar, but weaker bounds.’ 
R. A. Rohrer*® has derived independently the same stability bounds. 
However, he has done so in terms of classical dynamics (generalization 
of the equations of Hamilton and Lagrange) rather than in the circuit 
theory terms used here. 

The material in this paper is organized as follows: All aspects of circuits 
of resistors and capacitors are considered first — formulation of circuit 
equations, power relations, and stability conditions. Then the same 
analysis is shown to apply to circuits of resistors and inductors, by 
changes of variables. Thereafter, circuits including both inductors and 
capacitors are analyzed in a similar way. This appears to be less con- 
fusing than trying to develop the properties of all the kinds of circuits 
simultaneously. 


II, CIRCUITS OF RESISTORS AND CAPACITORS 


2.1 Formulation of Circuit Kquations 


Circuit equations can be formulated for linear time-varying com- 
ponents in almost exactly the same way as for fixed components. Basi- 
cally, there are two parts to the formulation. The first defines the be- 
havior of individual components; the second applies rules for combining 
the effects of the various components in a circuit. 

The components with which we are concerned are of the two-terminal 
or one-port type. lig. 1 represents a typical component, with terminals 
jand k. Voltages /; and Ef, are associated with the terminals. Current 
I,; enters the component through terminal k and leaves it through 
terminal 7. (Then Jj, = —Ih; .) 

The role of the component is to perform an operation which inter- 
relates the voltage difference 4; — H,; and the current J;;. The com- 
ponents with which we are concerned here perform only linear opera- 
tions. We shall be interested sometimes in the operation which transforms 


, 3 
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Mt / 
A / 
\ Ikj Ikj Z 
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/ / \ 
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Fig. 1 — A two-terminal component. 
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(i, — H;) into I,; and sometimes in the inverse operation. Thus we 
may write 


Ing = Anj(e — Fj) 


: (1) 
Ey — Ej = Auj ‘Th; 


in which A;; is a linear operation and A;; ’ is its inverse. 

Sometimes it is more convenient to consider the charge Q;; , which is 
of course related to the current by I,; = Q:;. We shall make extensive 
use of the symbol p to indicate differentiation: 


2 
a D) 
Then the charge-current relation becomes 
Leg = pQei (3) 
or, inversely 
On; = [ts dt. (4) 


The operations performed by linear resistors and capacitors are dis- 
played in Table I. For each of the two kinds of components, the opera- 
tions are stated in the two inversely related forms, and in terms of both 
I,j; and Q;; . Ina time-varying circuit, the resistance R;.; and conductance 
Gy; of the resistor and the capacitance C;; and stiffness Ky; of the ca- 
pacitor may be functions of time. 

When the coefficients vary with time they must be written in proper 


TABLE I — LINEAR OPERATIONS PERFORMED BY RESISTORS AND 
CAPACITORS 





Notation: Ey, — Ej 
kj 


voltage across component 
current through component 





Q:; = charge delivered to component 
Tey = pQk; 
Resistors: Gy; = conductance, R;,; = resistance 
Rij = Gay} 
Tj = Gij(Ex — Fj), Qs = S Gijy(Ex — E;)dt 
(Ex — Ej) = Rela; , (Ex — Hj) = RejypQij 
Capacitors: Cy; = capacitance, Ky; = stiffness 
Ck; = Ki} 
Qk = Cr;(Ex — Ej), Thy = pCyj(Ex. — Ej) 
Ex — Bj = KijQii , Ex — By = Kij S Lnjdt 
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relationship to the differentiation operation p. lor example 
al nl d 
pCrj( Ex — E;) = at [C(x — E;)\) 4 Cryp(E, — Ej). (5) 


Throughout the paper we shall be critically concerned with the ‘non- 
commutability of p and time-varying coefficients.” 

A circuit is formed by interconnecting a number of components, for 
example as illustrated in Fig. 2(a). The interconnections may be repre- 
sented by the corresponding linear graph, as in Fig. 2(b). The interac- 
tions between the various components are determined by Jtirchoff’s two 
laws. The voltage law says that the same voltage [;, can be assigned to 
each node (graph vertex) k, in forming the voltage differences for all 
components connected to k. Then the sum of the voltage differences 
around any mesh (graph cycle) must be zero. The current law states 
that the sum of the currents into any node must be zero. These remarks 
are exactly the same whether or not the components vary with time. 

We shall consider separately the two different forms of circuit equa- 


(a) 





(b) 


Fig. 2— A typical circuit: (a) circuit diagram; (b) linear graph. 
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tions which are commonly used — the node equations and the mesh 
equations. I‘or the node equations one node is chosen as datum, and the 
excitation (forcing function) is described as currents fed into other nodes 
as in Fig. 3(a). Then the node voltages (relative to the datum) are 


(a) 





NODE 0 = DATUM (Ing = - ZI yx, Eo = 0) 





Fig. 3 — Node and mesh currents and voltages: (a) node analysis; (b) mesh 
analysis. 
related to the currents by a vector differential equation. The procedure 
is exactly the same for time-varying circuits as for fixed circuits, pro- 
vided one is careful to preserve the correct order of the differentiation 
operator p and time-varying coefficients. 

The vector equation may be written 


Iy = (G+ pC)Ey. (6) 
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Here Ly and Jy are column matrices, or vectors, whose elements are 
the voltages and excitation currents associated with the various nodes. 
If only certain of the nodes are externally accessible, for excitation cur- 
rents, the elements of J corresponding to the other (‘internal’) nodes 
are simply constrained to be zero. 

G and C are square matrices defining the specific relation between Jy 
and Hy. Their elements are easily determined from the instantaneous 
conductances and capacitances of the resistors and capacitors of the 
circuit, and the usual relations of elementary circuit theory apply even 
though the conductances and capacitances vary with time. 

It follows that G and C, at each instant of time, have the properties 
of the matrices usually associated with fixed networks. Thus G and C 
are symmetrical, and if the conductances and capacitances of the com- 
ponents are nonnegative, G and C are positive definite or semidefinite. 
The symmetry and the nonnegative character of the matrices, together, 
lead to an important part of the specialization in circuit theory, rela- 
tive to the usual pure mathematics of differential equations. 

When the differentiation operator p appears in front of a matrix, it 
signifies differentiation of each element of the matrix. When it is fol- 
lowed by a matrix product, it signifies differentiation of each element in 
the single matrix equal to the product of matrices. Thus (6) is merely 
a compact way of writing 


d 
Iy = Gly + & Qw 


Qn = CEn ra (ais) (7) 


d d 
ae = (4 Ws); 


For the mesh equations, meshes are chosen in a somewhat arbitrary 
way, and the excitation is described as voltage generators inserted in 
the meshes as in Fig. 3(b). The meshes correspond to cycles in the linear 
graph, and the number of meshes is the maximum number of independ- 
ent cycles permitted by the topology of the graph. The mesh currents 
are related to the excitation voltages by a vector differential equation. 
As before, the procedure is the same for time-varying as for fixed cir- 
cuits, provided p and time-varying coefficients are written in the proper 
order. 

For a network of resistors and capacitors, it is more convenient to 
use charges in place of currents. Then 
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I 


Ex (Kk + Rp)Qs 
Iu = pQu . 


Here Hr, Qa, I are column matrices or vectors. Hy and I 4 are not 
the same as Hy and Jy, although they are related to them in a quite 
complicated way. 

R and K are square matrices defining the specific relation between 
Hy, and Qs. Their elements are determined from the instantaneous 
resistances and stiffnesses of the resistors and capacitors of the circuit, 
and the usual relations of elementary circuit theory again apply. R and 
K are related to G and C of (6), but in a quite complicated way. The 
elements of R and K are not simply reciprocals of elements of G and C 
nor are matrices # and K the inverses of G and C. 

Like G and C, R and K have properties of matrices usually associated 
with fixed networks. They are symmetrical, and if the resistances and 
stiffnesses of the components are nonnegative, the matrices are positive 
definite or semidefinite. 

A further point should be mentioned. If (8) is to be valid for all 
circuits, the constants of integration implicit in the Q;;, as defined in 
(4), must be consistent in the following sense: They must be such that 
the indefinite integral in (4) can be replaced by a definite integral, say 


t 
i Th; dt, 
to 


with t) the same for all kj. This is the same as requiring that all Q,; must 
be zero at some one time fo . 

When the condition is not met, the superposition theorem can be 
invoked to express the complete circuit relations in two parts. The first 
assumes that all capacitors are completely discharged at time t), and 
relates the charges accumulated at ¢ > t to the generator (mesh) voltages 
at t > t). The second starts with the actual charges at ¢ = to and deter- 
mines their later values in the absence of generator voltages. When the 
capacitances vary with time, the initial charges may have a much more 
important effect than is possible in a fixed circuit. This will be discussed 
in physical terms in Section 2.4.1. 


(8) 


2.2 An Algebra of p and. Matrix Coefficients 


This section introduces manipulations of a sort which we shall use 
extensively in this and later papers. In the manipulations it is convenient 
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to use both p and a dot over a symbol to indicate differentiation with 
respect to time: 


: d 
Generally (but not quite always) the dot will be used for rates of change 
of coefficients and p for differentiation of primary variables (such as 
voltages or currents) or products of coefficients and primary variables. 
By way of introduction, consider the following scalar expression: 


pax = £ (ax) = dx + apx = (4 + ap)z. (10) 


Suppose x is a principal variable and a is a time-varying coefficient. 
Then a may be regarded as a linear operator which multiplies x by a 
function of time. In the same way, 7: is an operator which differentiates 
x, pa is an operator which multiplies x by a and differentiates the prod- 
uct, and ap is an operator which differentiates « and multiplies the 
derivative by a. 

Equation (10) may be said to state a commutation rule, which may 
may also be stated as an “operator identity” (both sides of which are 
operators): 


pa = d+ ap. (11) 


Thus p and a commute without change if and only if a is constant, so 
that d = 0. 

The concept is easily extended to more complicated combinations, for 
example 


(a+p)\(b+p)=ab+b+(atdb)p+p. (12) 


An algebra of this sort, in p and scalar coefficients, is useful for the 
manipulation of scalar differential equations. It is a principal tool in, 
for example, Ref. 4. 

For present purposes we need to extend the concept to an algebra of 
p and matrix coefficients, as a tool for manipulating vector differential 
equations. Suppose X is a matrix variable and A is a matrix coefficient. 
As a first example, it is easily established that 


pAX = AX + ApX = (A+ Ap)X. (13) 


The relation follows at once from the fact that each element of AX is a 
sum of terms, each of which is a product of one element from A and one 
from X. Equation (11) can be applied term by term and the results can 
then be sorted into contributions to AX and ApX. 
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Corresponding to (13) is the “operator identity” 
pA = A+ Ap. (14) 


The concept is quickly extended to more complicated operations. Some 
“operators equations” are collected in Table II, together with some 
familiar algebraic matrix identities. In the table 


A‘ = transpose of A 
A’ = inverse of A (15) 
U = unit or identity matrix. 


Constant scalar and matrix coefficients commute with p without 
change. Time-varying coefficients do not. (But of course matrix factors, 
constant or not, generally do not commute with each other.) This may 
be regarded as the most important distinction between the theories of 
linear time-varying and fixed circuits. If it were not for the difference 
in the commutation rules, most of the familiar techniques applied to 
fixed circuits would apply directly to time-varying circuits. As it is in 
fact, the more complicated commutation rules lead to numerous com- 
plications, as we shall see. 


2.2.1 Matrices of Order One 


Much of this paper is concerned with scalar quantities (for example, 
net input power) derived from vector circuit equations. or some pur- 


TaBLE II —Some Matrix RELATIONS 








A, B, X, Y = matrices, gy = a scalar 


In relations which involve inverses, pertinent matrices are assumed to be square 
and nonsingular. 








SOME ALGEBRAIC IDENTITIES 
X(A + B)Y = XAY + XBY 


eX ¢ ~ 
(XY)! = ¥¢Xs, (X¥y1 = yuxn 








SOME OPERATOR IDENTITIES 
pA = A+ Ap, pA = A+ 2Ap + Ap? 


pAp = Ap+ Ap? = —pA + pA 
pAp = ~¢A+ppA, pAer = er(¢A + pA) 








DERIVATIVES OF SOME MATRIX FUNCTIONS 
pX? = XX + XX #2XX | 
pXYZ = XYZ+ XYZ4 XYZ 
Xp(X3) = -—X¥X1, pX = —X1XX1 x -(X-2X 
pX' = (pX)! 
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poses, scalars may be represented by matrices of order one, and the 
scalars in question will be derived in that form. Certain operations will 
be used repeatedly in this connection. 

Suppose W and V are n X 1 column matrices and Y is ann X n 
square matrix. Then 


YV = acolumn matrix 
; (16) 
W'YV = a matrix of order one. 
A matrix of order one is always symmetrical, for there are no off-diago- 
nal terms to interchange in forming the transpose. Thus, using a trans- 
pose rule from Table IT, 


WYV = (W'YV)' = V'Y'W. (17) 
Certain special cases are particularly }mportant. 
If Y' = Yand VW, V are column matrices, 
WYV = V'YW. (18) 
The differentiation rules in Table IT require 
WV'YV) = VIYV + V'YV + V'YV.« (19) 
Applying (18) gives 
if Y' = Y and V isa column matrix 
WYV = V'YV = i p(V'YV) — 3 V'YV. (20) 


Finally, suppose the transpose of Y is the negative of Y. Then (17) 
requires 


if Y’ = —Y and V isa column matrix 
VY¥V=aVY'V = —V'YV =0. (21) 


(If a quantity is equal to its negative it must be zero.) 
2.3 Power and Stability in Terms of the Node Equation 


2.3.1 Instantaneous and Average Powers 
We return now to the circuit equation of the nodal analysis: 
Iy = (G+ pC)Ey. (22) 


The power P supplied to the circuit by the excitation currents is 


LINEAR TIME-VARYING CIRCUITS 2587 
P = >> EvIvp = Ev'Ty. (23) 
k 


Multiplying the circuit equation by L’, 
P= Ey (G+ pC) Ey . (24) 
expanding in terms of identities in Table II gives 


P = Ky'GEy + Ey'CEy + Ey'CEy . (25) 


Applying (20) to the last term leaves 
P = BEy'(G + 3C)Ey + 3p(Ey'CEy). (26) 


The power relation confirms, in circuit equation terms, what must be 
expected on physical arguments. Thus 4p(Hy'CEy) is the rate at which 
energy is being stored electrically in the capacitors. Then Ey'GEy is the 
rate at which energy is being dissipated in the resistors, and 4#y'CEy 
is the rate at which energy is being removed from the circuit by what- 
ever means are used to vary the capacitances. (Recall that increasing a 
capacitance decreases the stored energy per unit charge. ) 

The average power P is frequently of interest as well as the instan- 
taneous power P. For the average over a finile interval, say t, to te, 
integration of (26) gives 


a t P ty 
Ave P = Ave [Ey'(G + 3C) Ey] + ily Cy) te C27) 


to to ty ty to ty 2(te — ty) 
When [y'CEy is bounded at all times, the last term approaches zero 
as (tg — t) approaches infinity. Thus, for long time averages, 
if Ey'CEy is bounded at all times 


P = Ave [Ey'(G + 3C)Ey]. (28) 


2.3.2 Linear Transformations on Hy and Iy 
The power equation suggests rearranging the current equation so as 


to emphasize the matrix (@ + 1C). Operator identities in Table I 
yield 

Iv = [(G +4C) + 4(pC + Cp) Ew. (29) 

It is now time to introduce a linear transformation of a sort which 


will be used extensively in this and later papers. In particular, let 


Ey = NE 


(30) 
I = N'ly. 
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Linear transformations of this sort are of course standard means for 
diagonalizing matrices. They are also well known as means for generat- 
ing equivalent circuits, when components are fixed.” Their appropriate- 
ness for time-varying circuits is by no means obvious, and they do in 
fact lead to serious complications not encountered in connection with 
fixed circuits. As usual, the complications stem from the commutation 
rules. 
Multiplying (29) by N‘ and using (30) gives 


fT = [N'(G + 3C)N + 3(pN'CN + N'CNp) 


este ie a (81) 
+43(NCN —NCN)IE. 
Note that 
(N‘'CN — N'CN)' = —(N'CN — N'CN). (32) 
When C is positive definite, there is a transformation N such that: 
N'CN= U 
(33) 


N'(G + 4C)N = —D. 


Here U is the unit matrix of suitable order and D is a diagonal matrix. 
Defining D with negative sign simplifies the later discussion. The exist- 
ence of a suitable transformation matrix, at each instant, follows from 
elementary circuit and matrix theory (for positive components). When 
C is only positive semidefinite, (6) can be transformed into a new equa- 
tion, in fewer dimensions, with a positive definite C. One such procedure 
is outlined in the Appendix. 

With the negative sign in the second equation of (33), the (diagonal) 
elements d;, of D are identical with the zeros \; of the determinant of 
matrix (G + 4C + AC). 


dik = Vx 


(34) 
det (G + 30 + aC) = 0. 


In our applications, because G and C are functions of time, the A; are 
functions of time. If the circuit components are always nonnegative, 
the d; are all real. 

When the transformation N is fixed by (33), (81) and (32) become 


f=(-D+pU+4+ J)E 
J = i3(N'N — (NN) (35) 
Dee 
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The power equation (26) becomes 
P= = BN'T = BT = —E'DE + 3p (#'B). (36) 


(The product £'JE = 0 because J‘ = — J and # isa column matrix.) The 
power equation could have been transformed directly, but we shall find 
it informative to have also the transformed current equation (35). 


2.3.3 Some Bounds on the Basis Functions 


Carrying the analysis only a little further establishes some interesting 
bounds on the basis functions of circuits which vary periodically. This 
subsection outlines a derivation, but simplifies the argument by means 
of some somewhat restrictive assumptions. The next subsection reviews 
the derivation and removes most of the restrictions. 

The basis functions are counterparts, for time-varying circuits, of the 
familiar natural modes of fixed circuits. In node terms, they are a set of 
linearly independent solutions of (22) for Hy with Jy = 0. Thus, if /, is 
a vector basis function, 


0= (G+ pC)E,. (37) 
When Jy = 0, P = 0 and (26) becomes 
0 = £,(G + 4C)E, + 4p(E,'CE,). (38) 


Also, when Iy = 0, f = 0 and the transformed equations (35) and 


(36) become 


0=(—D+ Ups J)E, 
0 = B,'DE, + 4p(B,'E.) (39) 
E, = NE,. 


If the circuit has n degrees of freedom there are n basis functions in 
the set. They may be chosen in many different ways, but each choice is 
a linear transformation on every other choice. 

If the circuit varies periodically, G and C in (37) and (38) vary peri- 
odically [and also D and J in (89)]. It is well known that the basis 
functions of a linear differential equation with periodically varying co- 
efficients can usually be so chosen that they are exponentials with 
periodic coefficients.* Any exceptions are singular cases which we can 
best take care of in the next subsection. 


* See the discussion of the Floquet-Poincaré theorem in a text on differential 
equations, such as pages 78-81 in Coddington and Levinson.°® 
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Thus for a periodically varying network, we can use 
i, = H, exp (st). (40) 


Here s, is a constant, the exponential is a scalar factor, and H, is a 
periodically varying vector. l‘or the purposes of this subsection, we can 
best restrict ourselves to circuits for which s, and H, are real. The re- 
striction will be removed in the next subsection. 

We shall find it convenient to replace H, by another periodic vector 
F, , related to H, by 


H, = Fie (41) 


in which @ is an arbitrary periodic real function of t (with the same 
period as the time-varying circuit components). In terms of F, , 


E, = Fe exp (sot + 6). (42) 


Because the exponential is simply a scalar factor, (42) and the last 
equation of (39) require 


l| 


BL, = F, exp (st + 0) 
: (43) 
oe = NF, . 
For any matrix A, 
pAe*'™ = fexp (set + O)][(s. + 6)A + pA]. (44) 
Using this relation in (39) and then cancelling out the exponential 
factor gives 


C21e +0 = De JR of, 
0 = F,[(s + 6)U — Dif. + 4p(F'P,). 
Now &, does not remain bounded over an infinite time interval. Hence 


averaging the second equation of (39) does not eliminate the second 
term. On the other hand, /*, is bounded at all times, and thus the second 


equation of (45) implies: 
0 = Ave F,'[(s, + 6)U — DIF, . (46) 


Since /’, , D and 6 are all periodic, while s, is constant, the long time aver- 


age is the same as the average over any one period. 
The matrix (sc + 9)U — D is diagonal. That is, all its elements are 
zero except on the main diagonal, where the typical element is 


Guz = So + 6 — rx 
Det (G + 40 + 2C) = 0. 


(45) 


(47) 
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Then multiplying out the matrix product in (46) gives 


0 = Ave d (s, + 6— Ax) he, (48) 
in which F,, is the kth element in the column matrix /, . 

When all the quantities are real (as assumed), condition (48) cannot 
be true if all the coefficients (s; + 6 — dz) are positive at all times, or 
if all are negative at all times. We can use this circumstance to set bounds 
on the exponent s, of the basis function. The arbitrary function @ has 
been introduced as a means of strengthening these bounds, as explained 
below. 

Let us assume temporarily that the A; are at all times distinct. Fig. 
4(a) illustrates a plot of the \; over one period of their periodic varia- 
tions. Let Ax, be the largest d;, , and let 61, be a choice of @ such that 


Aa — Ox = constant. (49) 


Tig. 4(b) illustrates a plot of the corresponding dX; — 61. Every coef- 
ficient (s, + 6 — dx) in (48) will now be positive unless s, is no greater 
than » Mo 6 ar . 

Now 6 is actually determined uniquely by (49). Because @ar is 
periodic, the average of 6, over any period is zero, and hence also the 
long-time average. Then, when Aar — Ox is constant 


Ave As — 6 ar) hap = Oy = AVE Mar 


(50) 
Oar = Am = Ave Am: 
Thus all terms in (48) will have positive coefficients (so + O11 — Ax) 
unless 
S¢ < Ave Aw a (51) 


Exactly the same sort of procedure leads also to a lower bound on s, . 
Let Am be the smallest d;, and 6, a choice of 6 which makes Am — Om a 
constant. Fig. 4(c) illustrates the corresponding \, — 8». The bound on 
Ss, can be written 


S = Ave Xm. (52) 


2.3.4 Recapitulation, Discussion, and Removal of Restrictions 


This subsection reviews the derivation of the stability bounds, and 
removes most of the restrictions. 

Our diagonalization of the matrices G + 3C and C assumed that C is 
positive definite. The Appendix describes how an equation with a posi- 
tive semidefinite C can be transformed into one of lower dimensionality 
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(a) 
Am 


Ak 


Am 


Am ~(AM-AVG Ay) = AVG Any 


7 Ak-(Ay-AVG Ay) 


A-(Ay - AVG Am) 





A-(Am-AVG Am) 


~Ak —(Am -AVG Am) 





TIME ——> 


Fig. 4 — Characteristic roots: (a) A; (b) A — 9, with @to make maximum = con- 
stant; (c) \ — 6, withé@ to make minimum = constant. 
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with a positive definite C. More exactly, the transformation is such that 
the components of the transformed current and voltage vectors may be 
divided into two autonomous parts. If n and m are the order and rank 
of the original matrix C, m components of the transformed current and 
voltage vectors are related by a differential equation of our standard 
form, in m dimensions, with a positive definite C matrix, and m linearly 
independent basis function subject to our stability bounds. The remain- 
ing nm — m components of the current and voltage vectors are related by a 
purely algebraic equation, which raises no questions of stability. 

In our derivation of the bounds on s,, we restricted ourselves to 
periodically varying circuits such that all the basis functions (corre- 
sponding to a particular choice) are exponentials with periodic coeffi- 
cients. In singular instances such a choice is not possible. However, for 
periodically varying circuits, the basis functions can all be exponentials 
with coefficients which are at most polynomials in ¢ with periodic coef- 
ficients. Furthermore, the coefficients are polynomials in ¢ only when 
more than one basis function has the same coefficient s; in the exponential 
factor exp (st). Out of the set of basis functions corresponding to a 
single s, , one can always be assigned a periodic coefficient H;, . This is 
sufficient to establish our bound on s, , without regard for the possible 
existence of other basis function with the same sg; . 

We also assumed that the constant s, and the periodic function H;, 
are real. The assumption is not actually necessary. With real circuit 
components, complex basis functions can be chosen in conjugate pairs. 
Then equally valid choices are the real and imaginary parts of the com- 
plex functions. Thus we can write, for a complex exponential basis 
function, 


(X, + iY, eet?! = Z,, exp (st) + iZo, exp (Sob) 
Zig = Xz cos wt — Y, sin art (53) 
Zog = Xo SIN wet + Vo COs wel. 


We can now use either Z;, exp (St) or Zoe exp (Sf) as a basis function, 
in place of H, exp (st). The only difference is that the Z,’s are not pe- 
riodic, as is H, . However, the Z,’s are bounded at all times, and so are 
F, and F, , which are now defined by 


Zio OY Zag Fe 
F, = NF,. 


(54) 
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Furthermore, the boundedness of Ff, is all that is required, for passing 
from (45) to (46), provided the average in (46) is a long-time average. 
Thus the real part s,, of a complex coefficient s, + 7, , also obeys our 
bounds. 

An alternate proof uses the complex basis function itself, replaces Jf,’ 
by its conjugate in (38) and (39) and sorts out real and imaginary parts 
of the equations. 

The exponents corresponding to circuits of constant resistors and 
capacitors are necessarily real. However, when both the resistors and 
capacitors vary with time, the exponents may be complex. A number of 
specific examples are known, including examples cited by Desoer and 
Paige’ and by Meadows.” 

Finally, we assumed that the zeros \; of the determinant of (G + 
1C¢ + )C) were distinct at all times. This is not necessary, provided we 
make a simple change in the statement of our bounds on s, . Suppose 
the various ),’s crisscross, as in Fig. 5 (which may be contrasted with 
Fig. 4a). The variables Aa, and \,, are now defined as the instantaneous 
maxima and minima over the set of variables \;, -:- , An. Otherwise 
the bounds are the same as before. 

Thus, for general circuits of periodically varying positive resistors and 
capacitors, if s, is the real part of the coefficient in the exponential factor 
associated with a basis function, 





TIME ——> 


Fig. 5 — Ay and An when ,;’s crisscross. 
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Avé Am S & S&S Aved\x (55) 
Am, Am = instantaneous min. and max. 
over the set \1, °°" , An- 
Det [G4 + 30 + »,C] = 0. 


When the circuit components are constants, the d; are constant, and also 
each s, is exactly equal to one of the \;. Why do the averages of the i, 
only furnish bounds when the components vary with time? 

The answer lies in the joint implications of the vector and scalar 
equations 


0 = [(s. + dU —D+ J\F, + vk, 
Ave {F,'[(s, + 6)U — D]F,} 

J = NN — (N°N)' 

Ji= —J 


[equations (45) and (46) of Section 2.3.3]. It will be sufficient to con- 
sider only the second-order case, for which (56) represents the following 
collection of equations 


(56) 


Q = Sis 6 yam MPa + Pa + Sil ss 
0=(s + 6- de) Pye + ie — Julia (57) 
0 = Ave [(s, + 6 — A)Ra + (se + 6 — do) Po’. 


(Jy = Je = Oand Jy = —Jy. because J’ = —J.) 
With constant coefficients, J, = 0, /, is constant, and one suitable 
solution is 
(ss +@—%1) =0, to satisfy the Ist eq. 
Fi. = 0, to satisfy the 2nd eq. (58) 
(s, + 6 — 1) and Fy = 0, to satisfy the 3rd eq. 


With time-varying components, making (s, + 6 — 1) zero in (57) 
leaves 


0 = ae ae Jil se 
(a — 2) Poe + Pee — Juke (59) 
Ave [Oy a Noho: 
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These equations are (at least usually) incompatible when Ji, + 0. Thus 
the nondiagonal matrix J, which appears only when components are 
time-varying, is what destroys the identity of the exponents s, and the 
determinant zeros \; (as here defined). Recall that J stems from the 
commutation rules applied to combinations of 2’, p, C, and LE. 

When the resistors are time-varying but the capacitors are fixed, C = 
0 and (G + 41€ + AC) becomes (G + AC), which has been studied 
extensively in connection with fixed networks. When C is positive 
definite and G is positive definite or semidefinite, none of the zeros Ax 
can be positive, and the same is true of their averages when they are time- 
varying. Then our bounds exclude any positive s, in the exponential 
factors exp (st) associated with our basis functions. But an unstable 
basis function (defined as one which grows indefinitely ) requires a posi- 
tive s,. Thus our bounds confirm the stability of circuits in which only 
the resistors vary. 


2.3.5 Comparison with a Known Property of Constant Matrices 


The bounds on s, are at least reminiscent of known bounds on the 
characteristic roots of a dissymmetrical constant matrix. Consider the 
roots Ss, of 


Det (A + s,U) = 0 (60) 


in which A is a dissymmetrical matrix. For the closest parallel to our 
analysis assume that the characteristic roots are all real. The equation 
may be rewritten as follows: 


Det (S + J+ 8,0) = 0 
S=1(A + A’), S= 8 (61) 
J =1(A — A’), Jé' = —J. 


I 


I 
I 


When the determinant is zero, there are nonzero vectors X such that: 


(S+ J+ sU)X = 0. (62) 
Because J’ = —QJ, this implies the scalar equation 
X'(S + s,U)X = 0. (63) 
Diagonalizing S leaves 
> (85 = eee = 0 
: (64) 


The J term in (62) excludes (in general) an X in which all elements 


LINEAR TIME-VARYING CIRCUITS 2597 


are zero except one. Then s, ¥ d;, but the largest and smallest ; are 
upper and lower bounds on s, . Simple changes in the analysis establish 
the same bounds for the real parts of complex characteristic roots. 


2.4 Stability in Terms of the Mesh Equation 


Section 2.3 dealt exclusively with the node equation. However, the 
mesh equation can be manipulated in almost exactly the same way. 
Recall the mesh equation [(8) of Section 2.1], 


Ey = (K+ Rp)Qx. (65) 


Note the order of operations Rp here, and contrast it with pC in the 
node equation. 
Define a scalar 1 by 


M= 2 Q wcll ux = Qu' Ex . (66) 


While W does not have the dimensions of power, it has many of the 
mathematical properties of the power function P which we associated 
with the node equations. Multiplying Ex by Qu’ and applying operator 
identities gives 


M —= Qu (K = 1R)Qu + 1p(Qu RQ xa). ‘ (67) 


Compare this with the power equation (26) in Section 2.3. The appear- 
ance of —3f in M, as opposed to +4C in P, reflects the difference of the 
order of operations in Rp and pC. The quantity 3(Qa'RQm) does not 
represent stored energy, but it is mathematically similar to the stored 
energy function 1£y'CE, . Both these quantities may be regarded as 
Lyaponov functions. 

Proceeding from here exactly as in the analysis of the node equations 
leads eventually to similar, but not identical, bounds. The basis func- 
tions are now mesh charges Q, instead of node voltages /, . When circuits 
vary periodically they can be chosen as exponentials with periodic 
coefficients: 


Q, = H, exp (sot). (68) 
The bounds on the s, (or the real parts when complex) may be written 
Ave \n S 8s S Aved\xu 
Am, Aa = Instantaneous min. and max. 
(69) 


over the set A1, °°: , An 


Det [K — 42 + 4R] = 0. 
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Now the coefficients s, must be identical in the exponentials associated 
with the node and mesh analyses. This is because the ratios of charges 
and voltages remain bounded. Thus the two sets of bounds apply to the 
same set of s,. Then one can form a single pair of bounds by choosing 
the lesser of the two upper bounds and the greater of the two lower 
bounds. 

When the capacitors are time varying but the resistors are fixed, R = 0 
and (K — 4R -+ dR) becomes the familiar (K + dR) of the fheoty of 
fixed circuits. Then the bounds require that such a circuit cannot be 
unstable (with positive circuit components) even though the time-vary- 
ing capacitors give power gain. 


2.4.1 A Complication in Degenerate Special Cases 


In the discussion of the mesh formulation we have ignored a degener- 
ate special situation, which complicates a more nearly general analysis. 
The complications arise when there is a node within the network to which 
two or more capacitors are connected, but no resistors. 

Consider first the simplest example, as illustrated in Jig. 6(a). Here, 
two capacitors are connected in series between nodes 7 and k, and no 
other components are connected to their common node c. If the capaci- 
tances are constant, onc can simply replace the two capacitors by a single 
equivalent. When they are time varying, the substitution may have to 
be more complicated. 

The reason is briefly as follows: Suppose there is a positive charge on 


(b) 


Fig. 6 — Illustration of a degenerate special case: (a) time-varying capacitors 
ae a common node to which no resistors are connected; (b) the Thevenin equiva- 
ent 
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one capacitor and a negative charge on the other. Then there will be a 
ratio of charges (at any one time) which will produce a zero voltage dif- 
ference between terminals 7 and kk. If the ratio of capacitances varies 
with time, the required ratio of charges varies with time. Conversely, the 
only combination of constant charges which yields zero voltage difference 
(i, — E;) at all times is zero charge on each capacitor. Furthermore, if 
there is initially a positive charge on one and a negative charge on the 
other, both charges cannot be reduced to zero through nodes 7 and &k 
alone. On the other hand, when the capacitances vary periodically there 
is a ratio of constant charges such that the voltage difference varies 
periodically and with zero average. 

Tor the mesh analysis one can invoke Thevenin’s theorem, and replace 
the two capacitors by an equivalent capacitor and a periodic, zero aver- 
age voltage source, as illustrated in Fig. 6(b). When there are several 
capacitors and no resistors to a single node, or a combination of resistor- 
less nodes, several Thevenin voltage generators are called for. A general 
characteristic of circuits which can lead to this sort of complication is a 
resistance matrix 2 which is singular (positive semidefinite instead of 
positive definite). 

It follows that the initial charges may be much more important in 
time-varying than in fixed circuits. or example, suppose there is only 
one resistorless node and only two capacitors connected thereto. In a 
fixed circuit, the effects of initial charges will eventually die out except 
for the voltage at the single resistorless node. In a time-varying circuit, 
however, the varying Thevenin voltage may produce voltages at all 
nodes forever. 

l’or the node analysis, one can ignore these complications by simply 
not eliminating the resistorless nodes. When there are resistorless nodes, 
the conductance matrix G is positive semidefinite. When G@ has order n 
and rank m (and C is positive definite), n — m of the basis functions have 
zero damping. This can be proved by applying to G, with only minor 
alterations, the analysis applied to a semidefinite C as outlined in the 
Appendix. The Thevenin voltages of the mesh analysis set up undamped 
oscillations within the circuit (s, = 0) and correspond to the undamped 
modes of the node analysis. The validity of the derivation of our stability 
bounds, which does not preclude a singular G matrix, implies that the 
bounds will include zero when G is positive semidefinite. 

One way to avoid these complications is to avoid ideal capacitors. If 
a resistor is connected across each capacitor, to represent the leakage 
through any actual component, there are no resistorless nodes. 
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III. CIRCUITS OF RESISTORS AND INDUCTORS 


The theory of circuits of resistors and capacitors can be applied to 
circuits of resistors and inductors by interchanging currents and voltages 
and also the mesh and node formulations. Since currents and voltages 
are interchanged, one needs variables which are related to voltages in 
the same way that charges are related to currents. Corresponding to node 
voltage Ff; , let 


E, = & (70) 


or, inversely 
o, = [a dl. (71) 


If an inductor is connected between nodes 7 and k, EH; — FE; is the 
voltage across the inductor, and ® — ©; is proportional to the flux 
linkages within it. The fact that the definition of &, in (70) or (71) leaves 
undetermined a constant of integration reflects the fact that constant 
flux linkages produce no voltage across an inductor. 

Table III states the linear operations performed by inductors, in two 
inversely related forms, and in terms of both H, — EH; and & — ;. 
The inductanee Z;,; and its reciprocal S,; can vary with time, provided 
the order of the differentiation symbol p and the coefficients is preserved. 

lor resistors and inductors, the mesh equation corresponds to the 
node equation (6) of our previous circuits, and is 


i: = (R + pL)I x é (72) 


The mesh voltage and current vectors ’y, and J y are defined as before, 
and also the matrix F of the mesh resistances. Then L is the mesh matrix 
of the inductances of the inductors. If one properly chooses constants of 
integration stemming from (71), the node equation is 


Tasue II] — Linear OPERATIONS PERFORMED BY INDUCTORS 


I,; = current through component 
E, — EH; = voltage across component 
d, — &; = variable related to flux 
Ei, = pe 
Ly; = inductance, S,; = reciprocal inductance 
Sx; = i 
b, — &; = Lele; , Ey — By = pLajle; 
kj = Sxj(@z — 5), Tei = Sry S (Ex — E;)dt 
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Ty = CS + Gp)®y 
Ey = pby 


and corresponds to the old mesh equation (8). Here vector Jy represents 
the node currents as before, and G is again the matrix of node conduct- 
ances. The elements of vector @y are the &, and S is the node matrix of 
the reciprocals of the inductances. 

The formulation of (73) requires an assumption regarding constants 
of integration, implicit in the definition of the &, , exactly like the as- 
sumptions regarding the Q;; in the formulation of (8). 

One can now proceed exactly as before to obtain power equations 
and bounds on the exponential factors associated with the basis func- 
tions of periodically varying circuits. 

Section 2.4.1 described degeneracies which occur in the mesh analysis 
of circuits of resistors and capacitors which have resistorless nodes. The 
counterparts for circuits of resistors and inductors occur in the node 
analysis of circuits with resistorless meshes. 


(73) 


IV. CIRCUITS WHICH CONTAIN BOTH CAPACITORS AND INDUCTORS 


4.1 Circuits of Capacitors and Inductors Only 


For circuits which contain both capacitors and inductors, but no re- 
sistors, the node equation contains the capacitor and inductor terms 
from our previous equations (6) and (73). Thus 


Iy = S@y + pCi y 
Ey = p®y 
or, replacing Ly by p®y , 
Iy = (S + pCp) by. (75) 


(74) 


As before, S is the node matrix of the reciprocals of the inductances, and 
C is the node matrix of the capacitances. 

To obtain the input power P from the excitation, multiply by Ey’, 
which is the same as @y’ 


P = by'(S + pCp)oy . (76) 


Manipulating in terms of operator identities, again identifying éy with 
Hy , and using the symmetry of matrices S and C gives 
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P = —1by'Sdy + 1Ey'CHy + 1p(@y'Sby) + 3p(Ey'CEy). (77) 


The first two terms are the rates at which energy is removed from the 
system by the means used to vary the inductances and capacitances. 
The last two terms are the rates of increase of the electrical energy stored 
in the inductors and capacitors. 

The basis functions ®, are solutions of 


0 = (S + pCp)4, . (78) 
The corresponding power equation is 
0 = —16,'Sb, + 4E,'CE, + 4p(®'S®,) + 3p(i.'CE.) 
Le = p®, . 


(79) 


If the circuit components vary periodically, we can define a bounded 
function I’, by 


&, = F, exp (sot + 6). (80) 
There will be a corresponding function F,’ defined by 

LE, = Fy exp (sot + 0). (81) 
Because I, = p®, , I,’ is related to F, by 

F/ = (s+ 6)F, + Fy. 7 (82) 


Under reasonable circuit conditions (which exclude, for example, dis- 
continuous changes in inductances), F’,’ is bounded, as well as F, . Using 
(80) and (81) in (79), eliminating the exponentials and averaging gives 


0 = Ave F,'[(s, + 6)S — 4S]F, + Ave F,’t[(s, + 6)C + 4C]F’. (83) 


If both S and C are positive definite, all matrices can be diagonalized. 
This merely requires a different transformation on /’, and I’,’. Thus let 


F,=N,, F! = N.'P, (84) 
and choose N, and N,’ in such a way that 
N.'SN, = U, N."CN.' = U 


te a (85) 
N.'SN, = 2D.,  N,"CN,! = —2D/. 


Here U is again the unit matrix and D, and D,’ are diagonal matrices. 
After transformation, (83) can be written 


0 = Ave DS { (so + 6 = Mie + (So + 6 _ he) Fe”. (86) 
k 
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Here, F*,, and Ff,’ are elements of the column matrices /’, and P,’, and 
Ax and d;’ are the corresponding (like-rowed) diagonal elements of D, 
and D,'. As before, all quantities can be made real (provided Fy, , Fox’ 
are only required to be bounded, not necessarily periodic). Then s, can- 
not be so large that all terms in the sum are positive, or so small that 
all terms in the sum are negative. 

If one makes no use of the implicit relation between /, and /,’, one 
can consider the two sets of constants \, and A,’ as simply two parts of a 
single set. Then bounds on s, can be obtained exactly as before. Since 
the \; depend only on C and the 2,’ only on S, while C and S represent 
different circuit components, the two sets of bounds are likely to criss- 
cross as in Fig. 7. Thus, including also the relation of \, and d;’ to S and 
C [defined by (85)], the bounds may be written 


Ave (min. over \) S s, S Ave (max. over )) 


set \ = A,’s and ),’’s 
(87) 
Det (—3S + rS) = 0 
Det (26 + 4’C) = 0. 


The mesh analysis differs only in the specific quantities involved. The 
mesh circuit equation is 


(K + pLp)Qu 
Iu = pQ a . 


As before, K is the mesh matrix of stiffnesses and L is the mesh matrix 
of inductances. 
The input power from the excitation is 


= —1Qy'KQu + Tae Dlg a 4p(Qu KQ a) + Ly(Iuy'LI x). (89) 


The bounds on sz may be written 


Lin 


(88) 


Ave (min. over \) S s, S Ave (max. over )) 


set X = X;’s and X;/’s 


(90) 
Det (—31K + 4K) = 0 


I 


Det (40 + L)= 0. 


4.2 Circuits of all Three Kinds of Components 


lor cireuits which contain capacitors, inductors and resistors, the 
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Ak OR Ak 


TIME ——> 


Fig. 7 — Characteristic roots of (— 3 8 + 8) and ( C+ C). 


node equation may be written 
Iy = (S + Gp + pCp) bn 
Ly = pty. 
The corresponding power equation may be written 
P = —i6y'Sby + iE y'GEy + 4B y'CEy 
+ 3p(Sy'Séy) + 4p(Ev'CEy) 





(91) 


(92) 
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in which the new term, 3/y'GEy , represents the power dissipated in the 
resistors. 
The stability bounds may be written 


Ave (min. over \) S s, S Ave (max. over )) 
set \ = ),’s and Ax”’s 
Det (—3S + 8) = 0 
Det (G + 406 + 2/C) = 0. 


Note that );’ is the same as },’ for a circuit of capacitors and resistors 
only [see (55)]. On the other hand, the present \; do not depend at all 
on the resistors. 

The corresponding mesh equation is 


Ey = (K + Rp + pLp)Qu 
Iy = pQ a . 


(93) 


(94) 


Then the power equation may be written 
P= —4Q.'KQu + LT REG + 1G reg ee 
i 4p(Qu' KQa) as dn(I'LI a). 


Finally, the stability conditions may be written 


(95) 


Ave (min. over \) S s, S Ave (max. over )) 


dx’S and A;’’s 


0 


set Ar 


(96) 
Dept =k ek 


I 


Det (R+3L+’L) = 0. 


4.3 Discussion 


Let us return temporarily to circuits of capacitors and inductors only. 
Suppose all the components are constant (and positive). Then it is well 
known that the basis functions (natural modes) are undamped. Since 
Sand C = 0, etc., when the components are constant, all \, and d,’ in 
(88) or (90) are zero. Then the upper and lower bounds must be zero. 
Thus, in the special case of fixed components, our bounds collapse onto 
the actual s, . 

Suppose the inductors are fixed but the capacitors are time-varying. 
Then, for example, in the bounds (88), 4; = 0 but generally A,’ ¥ 0. 
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The zero coefficients \; will affect the maximum or minimum of the com- 
bined set \ at any times when all \;’ have the same sign. 

When components vary periodically, C and S, etc., vary periodically 
and C and S, etc., have zero averages. Then any nonzero elements in 
C and S, etc., must be positive some of the time and negative some of 
the time. This suggests (but does not prove) that the bounds on s, in- 
clude zero. It is confirmed by some quite different analysis, which is 
outlined in a short paper in this issue of the B.S.T.J.7 The paper points 
out that the s,’s of nondissipative circuits occur in pairs of the form 
+s,, —s,. Thus when any s, has a nonzero real part, another s, will 
have the negative of that real part, and any true bounds are consistent 
therewith. It also follows that the bounds described above can be 
tightened (for nondissipative networks) by using the bound closer to 
zero and its negative. 

The basis functions of a circuit of capacitors and inductors may all 
have zero damping even though the components are periodically time- 
varying. This remark is supported by well known properties of the 
Mathieu-Hill equation, which corresponds to the one-dimensional special 
case of our vector equations. A weakness of our bounds is that they do 
not give meaningful sufficient conditions for basis functions with (posi- 
tive or negative) damping different from zero. The bounds will (at least 
almost) always be different from zero, but by bracketing zero they will 
not exclude it. 

Suppose now the circuit includes resistors as well as inductors and 
capacitors. Consider first the bounds (93) derived from the node equa- 
tion. If the node conductances (represented by matrix G) are sufficiently 
large, every characteristic root \;’ will be more negative than every 
characteristic root \; (which does not involve G). Then the lower (more 
negative) bound on s, will be set by the d;’ and the upper bound by the 
dx. If the conductances are further increased, the lower bound will be- 
come more negative, but the upper bound will remain unchanged. On the 
other hand, increasing the conductances will usually increase the damp- 
ing of all the basis functions. 

The same remarks apply to the bounds derived from the mesh equa- 
tion except that mesh resistances replace node conductances. Thus when 
resistors add substantial damping in circuits containing both capacitors 
and inductors, our wpper (more positive) bounds may be too weak to 
have much significance. 

Our node analysis assumes that both S and C are positive definite, 
although the initial formulation may make one or both positive semi- 
definite. As before, however, the procedure can easily be modified for 
positive semidefinite matrices. Recall that two different transformations 
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are used to diagonalize S and C, in the derivation of our bounds. When 
both are positive semidefinite, they can be separately transformed to 
positive definite submatrices bordered by zeros. Because constant transfor- 
mations can be used (in accordance with the Appendix), the transformed 
S and C retain the derivative relationship to the transformed S and C. 
Then the number of characteristic roots \; , Ax’ depends on the ranks of 
S and C. When only capacitors are connected to some nodes, the mesh 
analysis will involve some zero damped Thevenin voltages; when only 
inductors appear in some meshes, the node analysis will involve some 
Thevenin currents. 


APPENDIX 
Positive Semidefinite Capacitance Matrices 

When C is only positive semidefinite, one can transform equation (6) 
as follows: The rank m of C is now less than the order n. There exist 
transformation matrices NV; such that 


HCN, = 


R 0 (07) 


0 ¢ 





The zeros represent submatrices of zero elements, bordering the com- 
plete matrix with 2 — m rows and n — m columns of zeros. Then C is 
an m XX m positive definite submatrix. 

These remarks apply to time-varying as well as to fixed circuits. 
Furthermore, the matrix NV; can be so chosen that it is constant, provided 
the time-varying capacitances are always positive (>0, not 20). In 
fact, Ni can be chosen as a matrix of 0’s and 1’s only. A detailed demon- 
stration is beyond the scope of this paper; briefly it derives from the fact 
that the rank of C is determined by the topology of the capacitor part 
of the circuit, without regard to the values of the capacitances (provided 
they remain >0, so that the topology cannot be a function of time). 

Because NV, = 0, Ni‘p = pNy', and (6) can be transformed (as for 
fixed circuits) into 


i= [a +o]) 3| |# (98) 


E = MiF,, t, = N,'T. 
A further transformation, Ne , is now easily found, of the form: 


U Nw 
0 U 


G0 


0 G i?) 











|| 








| Ne U 
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Here U represents a unit, or identity submatrix of appropriate order, 
and Ny is an m X (n — m) submatrix. Because of the zero submatrix 
in the upper right-hand corner of the complete matrix, 


U = Ol|o O|/U Ne 0 0 
Ny» UljO Cllo Oo 0 ¢ 


Thus the capacitance matrix is unaffected by the second transformation. 
The elements of the Ni2 portion of Nz may be time-varying. However, 
because the time-varying elements combine with no nonzero elements of 

C, No'pC = pN2‘C. Thus the second transformation yields 
sae | By. (101) 


, [la 0 
r=| 0 ¢ 


0 G 
We can now divide the components of the transformed current and 
voltage vectors into two parts, as follows: 


























: (100) 











+p 


i, = GE, in n — m dimensions 


a sey ot (102) 
I= (G+ pO)#, in m dimensions. 


The first equation is algebraic, and its diagonalization is routine. The 
second equation is in our standard form (6), with C positive definite, 
and it may be transformed further in accordance with (29) to (36). 

We arrived at this formulation by starting with a one-to-one cor- 
respondence between modes and components of the current and voltage 
vectors, and then transformed the corresponding differential equation. 
Desoer and Paige! arrive at the same conclusion by starting with more 
general choices of the current and voltage components, taking account 
of the circuit topology. 
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Mode Selection in an Aperture-Limited 
Concentric Maser Interferometer 


By TINGYE LI 
(Manuscript received May 22, 1963) 


The concentric interferometer with a limiting aperture in its mid-plane 
ts analyzed for its mode-selective properties. Two of the lowest-order trans- 
verse modes and their losses for the infinite-strip geometry are computed by 
solving the associated integral equations by the method of successive approxt- 
mations. The apertured concentric interferometer 7s found to be more mode- 
selective than the apertureless concentric interferometer or the Fabry-Perot 
interferometer with parallel plane mirrors. Computed results indicate that 
the optimum aperture size for maximum mode selectivity is approximately 
the size of the major lobe of the diffraction pattern of the dominant mode at 
the aperture plane. However, the maximum selectivity attainable does not 
exceed that of the “comparable” confocal system. The latter system ts not 
very ‘practical because tt requires either very long resonator lengths or very 
small mirrors. 


I. INTRODUCTION 


Interferometer-type resonators that are used in optical masers are 
inherently multimode devices. The resonant modes that can exist in 
such devices may be classified as longitudinal and transverse modes. 
The longitudinal mode order is determined by the number of field varia- 
tions along the axis of the interferometer, while the transverse mode 
order is determined by the number of field variations in the plane of 
the mirrors. For each longitudinal mode order, there exists a set of trans- 
verse modes. The number of modes that can partake in the oscillations 
of an optical maser is dependent on the geometry and the losses of the 
resonator, the width of the atomic resonance of the active material and 
the degree of population inversion. Practically, an optical maser will 
oscillate in several modes simultaneously unless special steps are taken 
to suppress the unwanted ones. For applications such as optical com- 
munication it is desirable from the standpoint of noise, coherence, spec- 
tral purity, etc. to suppress all but one mode in a maser oscillator. There- 
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fore, mode-selection schemes are very important if the optical maser is 
to be a useful source of coherent and monochromatic radiation. 

Aside from losses of a random nature such as those due to inhomo- 
geneities in the medium and mirror imperfections, the resonant modes 
in a maser interferometer also suffer from losses due to diffraction around 
the mirrors.!? The diffraction loss varies very little with longitudinal 
mode order, but increases very rapidly with increasing transverse mode 
order. Thus interferometer-type resonators are inherently mode-selec- 
tive with respect to transverse modes. By using a long, thin configura- 
tion (small mirrors and large mirror separation) it is possible to suppress 
all but the dominant transverse mode.’ Also, by operating just above 
the oscillation threshold’ or by using a short resonator® it is possible to 
restrict the oscillations to a single longitudinal mode. The output power 
in these cases is somewhat limited. 

Since it is desirable to pump the active medium strongly so as to ob- 
tain as much power as possible from a maser oscillator, methods for 
providing additional mode selection are important. In general, mode 
selection involves the introduction of loss to the resonator in some pre- 
scribed manner. Kleinman and Kisliuk® have proposed the use of an 
additional I’'abry-Perot interferometer to discriminate against unwanted 
longitudinal modes. Kogelnik and Patel’ obtained essentially a single- 
frequency output from a gaseous maser using three concave mirrors. 
Collins and White’ used two tilted Fabry-Perot etalons within the reso- 
nator of a ruby maser to select longitudinal as well as transverse modes. 
Schemes for suppressing unwanted higher-order transverse modes by 
employing a limiting aperture in the focal plane of a suitable optical 
system within the resonator of a ruby maser were tried by Burch,® Baker 
and Peters,!® and Skinner and Geusic.!! These latter schemes are essen- 
tially equivalent to an interferometer system consisting of a pair of 
spherical mirrors spaced concentrically and having a limiting aperture 
in its mid-plane. Since the field distribution over the mid-plane of a 
concentric system is essentially the far-field pattern (Fourier transform) 
of the field over the mirrors, the higher-order transverse modes will 
have a wider lateral spread than the lower-order ones. Therefore, a suita- 
bly chosen aperture will introduce very little loss to the dominant mode 
but will introduce appreciable loss to the higher-order modes, making 
the apertured system quite mode-selective to transverse orders. 

In order to study the mode-selective property of an apertured con- 
centric system, we have set up the appropriate integral equations for 
the infinite-strip geometry and solved them iteratively on a digital com- 
puter for the intensity distributions and the losses of the two lowest- 
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order transverse modes. The computed results indicate that there is an 
optimum aperture for maximum mode selectivity,* but that this maxi- 
mum never exceeds that of the “comparable”’ confocal system. In fact, 
as the aperture is made smaller and smaller the behavior of the apertured 
concentric system approaches that of the confocal system. However, 
the comparable confocal system is not very practical because it requires 
either very small mirrors or very large mirror spacing. 
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Fig. 1 — Geometry of the aperture-limited concentric maser interferometer. 
(a) Full-concentric system with aperture. (b) Equivalent half-concentric system. 


II. INTEGRAL EQUATIONS OF THE SYSTEM 


Since the full-concentric system with a mid-plane aperture is equiva- 
lent to a half-concentric system with a plane mirror of the same size as 
the aperture,” it is convenient to use the half-concentric model for the 
formulation of the integral equations (Fig. 1). Also, it is convenient to 
use the two-dimensional model of infinite-strip mirrors, since the three- 
dimensional problem of rectangular mirrors can be reduced to a two- 
dimensional one.! The behavior of systems with circular mirrors is ex- 
pected to be similar to that of square-mirror systems. 

Fig. 1(a) shows the geometry of anapertured concentric interferometer. 


* The loss of the higher-order mode relative to the loss of the dominant mode 
may be regarded as a measure of mode selectivity. 
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The infinite-strip concentric mirrors are of width 2a, and are separated 
by b, which is twice the radius of curvature of the mirrors. The aperture 
is of width 2a, and is in the mid-plane of the interferometer. Fig. 1(b) 
shows the equivalent half-concentric model which is to be used for the 
analysis. The integral equations defining the modes can be derived from 
our previous analysis on interferometers with curved mirrors by setting 

= 0 and gz = 1 in equations (11) and (12) of Ref. 12. The resulting 
equations are 


eitl4 ag 


ar L 





yy? (a1) = oP [jk (Qa, — 22") /2b'lW (ae) dx, (1) 
and 


eitl4 


yy? (a2) = a/b [. exp [jk(Qaizz — 22") /2b/WW(a1) dar (2) 


where the subscripts and superscripts one and two denote the curved 
mirror and the plane mirror, respectively, as shown in Fig. 1(b). The 
y’s are the eigenfunctions that describe the relative field distributions 
over the mirrors, and the y’s are the corresponding eigenvalues that 
specify the loss and the phase shift the wave suffers during each transit. 
The propagation constant k is equal to 27/\, where \ is the wavelength 
in the medium. The mirror separation b’ is equal to b/2. 

Equations (1) and (2) are single-transit equations which can be com- 
bined to form two round-trip equations. They are 


(an) = c. Kile, #0 (a) da, (3) 


— 0) 


and 
a2 
yy (22) = / K2(2, %2)~ (He) dae (4) 
Se, 
where the kernels Ay and Ke are given by 


Kats, ts) = 525 [exp Ljkleer +) — 28 }/0lde (5) 


Ko(22, f) = \b! cf exp [jk {a( az + £2) = (a9 + %_") /2}/b'} dx, (6) 


and the eigenvalue y is equal to y® y®. Since one round trip in the 
half-concentric system is equivalent to a single transit in the full-con- 
centric system, (3) may be regarded as the integral equation defining 
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the modes of the aperture-limited concentric interferometer while (4) 
gives the field distribution across the aperture. 

The kernels K, and K, as defined by (5) and (6) are symmetric; that 
is, Ky(a,, %) = Kyi(%, x) and Ko(xe , 2) = Ke(%2, x). Therefore, the 
eigenfunctions y¥, and y,@ corresponding to distinct eigenvalues yn 
are orthogonal in the sense that 


[be Cerdbm (ar) diy = 0, m xen (7) 


| ; Va” (at2)Wm (a2) dxz — 0, men. (8) 


As in the general case of curved mirrors,” the eigenfunctions are com- 
plex and are orthogonal in the non-Hermitian sense. 

The integral equations can be solved numerically using iterative tech- 
niques. However, it is possible to extract the asymptotic behavior of 
the solutions from these equations for very small apertures. Thus for 
(a2/a) K 1 and a.?/bv\ < 1, the terms involving x.” and %,? can be neg- 
lected and the integral equations reduce to those for the asymmetric 
confocal configuration (with unequal mirrors). It has been shown by 
Boyd and Kxogelnik"“ that the modes and the corresponding losses of an 
asymmetric confocal system are the same as those of a symmetric sys- 
tem with equal mirrors of width 2a where a = ~a,a2.. Therefore, the 
behavior of the aperture-limited concentric system approaches asymp- 
totically that of the confocal system as the aperture is made smaller 
and smaller. We designate the confocal system with Fresnel number 
N. = a@2/b’d as the comparable confocal system. 


III. COMPUTED RESULTS AND DISCUSSION 


The two lowest-order modes and their eigenvalues for an aperture- 
limited infinite-strip concentric interferometer were computed using the 
method of successive approximations! on an IBM 7090 computer. One 
hundred increments were used in the numerical integration of (1) and 
(2). Curves for power loss per transit for the two lowest-order modes 
and for different values of Fresnel number (N = a;2/bA) are given in 
Tig. 2. The abscissa is the half width of the aperture normalized to a,/N, 
where both a; and N should be regarded as constants. The dashed curves 
give the loss of the comparable confocal interferometer as functions of 
its half Fresnel number, which is equal to aa2/bA. The losses for the 
limiting case of infinitely wide aperture are given on the column on the 
right side of the figure. 


2614 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


a 












































POWER LOSS IN PER CENT 


























0.2 0.3 b 0.4 0.5 INFINITELY 
Na, /a,=4a,a AN=N-./2 WIDE APERTURE 
2/Ay 1 2/ c/ (a2 00) 


Fig. 2 — Power loss per transit for the two lowest-order modes of an aperture- 
limited concentric interferometer. The abscissa is the half width of the aperture 
normalized to ai/N. (Both a; and N should be regarded as constants.) The dashed 
curves are for the ‘‘comparable”’ confocal interferometer. 
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Figs. 8 and 4 show the relative amplitude and phase distributions of 
the field intensity for the two lowest-order modes. Curves A and B are 
for the aperture-limited concentric interferometer with N = 10 and 
a2/a, = 0.0325; A is the field distribution over the mirror while B is the 
field distribution over the aperture. Curve C is for the comparable con- 
focal interferometer with a Fresnel number (N,) of 0.65. Curve D is for 
an apertureless (a2 — ©) concentric interferometer with N = 10 and 
is the same as for the parallel plane configuration except for the reversal 
of sign in the phase distribution.!:? 

The ratio of the loss of TEM, mode to the loss of TEMo mode, which 
is a measure of mode selectivity, is plotted in Fig. 5 as a function of the 
normalized aperture half width. The dotted curve represents the same 
ratio for the comparable confocal system plotted as a function of its 
half I’resnel number, which is equal to aja:/bA. The short slant lines 
represent segments of the loci of constant loss for the TEMo mode. The 
ratio of the losses for the limiting case of infinitely wide aperture is ap- 
proximately four* for large N (N > 1) and is given on the column on 
the right side of the figure. 

That a suitably chosen limiting aperture placed in the mid-plane of 
a concentric interferometer should be mode-selective can be surmised 
by considering the field distribution over its mid-plane. This field dis- 
tribution, as given by (2), 1s, except for a quadratic phase factor, the 
Fourier transform of the mode pattern over the mirrors. It resembles 
very closely the far-field pattern of the equivalent parallel plane sys- 
tem! and therefore has a lateral spread which increases with mode 
order. Consequently, a limiting aperture having the width of the major 
lobe of the dominant mode would have a small effect on that mode but 
would increase quite significantly the losses of the higher-order modes. 
The optimum aperture width (for a given N) corresponding to maxi- 
mum mode selectivity as shown in Tig. 5 is approximately equal to the 
major-lobe width. As the aperture is made larger, more and more of the 
minor lobes are uncovered and they interfere either constructively or 
destructively at the mirrors, producing the oscillations in the loss curves 
in Fig. 2 and in the relative loss curves in Fig. 5. 

Fig. 5 shows that a suitably chosen aperture can increase the relative 
loss of the TEM, mode several times its apertureless value. The effect 
of the aperture on other higher-order modes is expected to be even 
greater. By observing the number of iterations required to produce a 


* For large Fresnel numbers (V > 1) the ratio (loss of TEM, mode)/(loss of 
TEM, mode) is approximately (1 + n)? for parallel plane or concentric configura- 
tions. 


2616 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


1.0 
































RELATIVE AMPLITUDE 
°o 
in 


TEM, MODE 






































RELATIVE PHASE IN DEGREES 























oO of 02 03 04 #O5 O06 O7 O8 o9 1.0 
x2/(MIRROR OR APERTURE HALF WIDTH) 


Fig. 3— Relative amplitude and phase distributions of the lowest-order 
(TEM,) mode. Both A and B are for an aperture-limited concentric interferometer 
with N = 10 and a2/ai = 0.0325; A is the field distribution over the mirror and B 
is the field distribution over the aperture. C is for the ‘‘comparable”’ confocal 
interferometer with a Fresnel number of 0.65. D is for a concentric interferometer 
with N = 10 andaz— ». 
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Fig. 4— Relative amplitude and phase distributions of the second lowest- 
order (TEM,) mode. The notation for the curves is the same as for Fig. 3. 


steady-state solution from an arbitrary initial trial function, it is possi- 
ble to infer the relative loss of other higher-order modes. For example, 
the number of iterations required for the dominant (TEMo) mode of 
an aperturcless concentric (or parallel plane) system with N = 10 is 
about 800, whereas with an aperture such that a2/ai = 0.0325 the num- 
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ber required reduces to about 25, which indicates that the relative losses 
of the higher-order even-symmetric modes are now very much higher. 
The behavior of the apertured concentric system was found analyt- 
ically to approach that of the confocal system in the limit of very small 
apertures. The computed results confirm this and show further that the 
mode selectivity (in terms of relative loss) of the apertured concentric 
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Fig. 5 — Ratio of the losses of the two lowest-order modes versus normalized 
aperture half width. The dashed curve is the ratio for the ‘‘eomparable’’ confocal 
interferometer. The short slant lines are segments of the loci of constant loss for 
TEM, mode. 
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system can never be greater than that of the confocal system. However, 
the effective suppression of unwanted higher-order modes in a maser 
oscillator requires not only that the relative losses of the higher-order 
modes be high but also that their absolute losses be greater than the 
gain of the active medium. To satisfy the second condition a confocal 
system would have to operate witha very small I'resnel number, whereas 
a reasonably large I'resnel number can be used for the apertured con- 
centric system. For example, if a maser having a gain of 20 per cent per 
pass is required to produce a single transverse-mode output, an apertured 
concentric configuration (with square mirrors) having N = 20 and 
d2/a, = 0.018 can be used, but a confocal configuration would need 
an N, of about 0.7, which means either very long resonator lengths or 
very small mirrors. Such configurations are generally undesirable be- 
cause lengthening the resonator tends to increase the number of longi- 
tudinal modes that can oscillate and decreasing the size of the mirrors 
tends to diminish the power output capability. 
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Broadband Electro-Optic Traveling-Wave 
Light Modulators 


By M. D1IDOMENICO, JR. and L. K. ANDERSON 
(Manuscript received June 13, 1963) 


A detailed analysis has been carried out of a broadband traveling-wave 
electro-optic light modulator using single crystals of currently available ma- 
terials. The structure analyzed is that first proposed by Rigrod and Kaminow 
in which a light beam is reflected back and forth across a microwave trans- 
mission line with the angle between optical and microwave phase velocity 
vectors chosen so that the component of the optical phase velocity vector in the 
direction of nucrowave propagation is equal to the microwave phase velocity. 
For generality both amplitude and phase modulation are considered in single 
crystals belonging to three different symmetry groups, Ta and Dea(Va), 
(linear electro-optic effect) and O;, (quadratic effect), for two different orienta- 
tions of the modulating field. . 

The inclusion of such practical factors as microwave and optical loss re- 
sults in an optimum design in which the modulator dimensions and operat- 
ing temperature are uniquely determined by the optical and microwave di- 
electric properties of the modulating medium. The main conclusions of the 
analysis are: 

(i) Either cuprous chloride or suitably biased strontium titanate may be 
used to produce 50 per cent linear modulation over a bandwidth of 10 ge 
with less than 5 watts of modulating power, using structures a few centime- 
ters long that have practical manufacturing and operating tolerances. 

(iz) The upper modulation frequency linut ts set by a cutoff frequency 
which arises from the finite width of the optical bean. 

(iit) The presence of appreciable crystalline strain in the modulating 
medium requires that the modulating system consist of a simple phase 
modulator followed by some form of frequency-selective discriminator. Such 
a system requires monochromatic but not necessarily coherent light. 


I, INTRODUCTION 


Broadband microwave modulation of light requires the use of an ex- 
tended traveling-wave structure. In order to obtain a cumulative inter- 
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action between the microwave and optical traveling waves, some form 
of velocity matching is necessary. In a structure of finite length a prac- 
tical velocity matching criterion is that the component of the optical 
phase velocity vector taken along the microwave wave vector be equal 
to the microwave phase velocity. Several possible arrangements imme- 
diately suggest themselves for realizing this type of interaction. For ex- 
ample, we could use true traveling-wave structures!“ or iterative struc- 
tures® where the optical and modulating signal are adjusted to interact 
intermittently with the proper phase. Broadband modulation further re- 
quires that the modulating structure have little dispersion, which im- 
plies propagation ina TEM or TEM-like mode. Thus, since the dielectric 
constants of usefully transparent modulating media are much less at 
optical frequencies than at microwave frequencies, broadband micro- 
wave modulation of light can be achieved by using some form of optical 
slow-wave structure, of which the simplest is that first proposed by 
Rigrod and Kaminow.! In this structure, shown schematically in Fig. 1, 
an optical plane wave zigzags back and forth in an electro-optic medium 
between plane mirrors while the interacting microwave modulating 
signal propagates longitudinally down the structure. The object of this 
paper is to analyze in detail the operating characteristics of this form of 
modulator, to present a systematic and quantitative procedure for de- 
signing modulators that require a minimum of modulating power, and 
finally to present numerical results for a number of presently available 
electro-optic materials. 

The kinds of materials considered here fall into two classes. The first 
of these comprises materials which exhibit a linear electro-optic effect, 
and the second includes materials which possess a quadratic or Kerr 
electro-optic effect. The specific materials considered in the first class are: 

(z) cubic crystals of symmetry 77, such as cuprous chloride (CuCl) 
and zine sulfide (ZnS), 

(it) KDP and its isomorphs of symmetry De(Va). 

The second class of materials consists of centrosymmetric cubic crystals 
of symmetry O; such as the titanates (SrTiO; and BaTiO;) operated in 
their paraelectric phase. 

All electro-optic modulators using transparent media are basically 
phase modulators in that the modulating fields act directly to change 
the optical phase velocity of the medium. The phase modulation intro- 
duced on a light beam can then be used to transmit intelligence, or if 
desired it may be converted into intensity modulation. This latter process 
may in a sense be viewed as a homodyne process® in which we beat two 
phase-modulated normal modes of a birefringent modulating medium 
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Fig. 1 — Basic zigzag modulator configuration (after Rigrod and Kaminow, 
Ref. 1). 


(e.g., an ordinary and extraordinary wave), the mixing taking place in a 
polarization analyzer (e.g., a Nicol prism) following the modulator. Most 
amplitude modulators to date have been of this form. 

A systematic comparison of the suitability of the aforementioned ma- 
terials in a zigzag modulator is complicated by the fact that the optimum 
modulator configuration depends upon the form of modulation desired 
(e.g., AM, PM, polarization modulation, etc.) as well as on the demodu- 
lation system used (e.g., simple quantum counter, heterodyne, etc.). The 
comparison is further complicated by the necessity of taking into account 
such practical problems as crystalline strains, losses (both microwave 
and optical) and various operational and manufacturing tolerances. Of 
these effects the existence of strain, particularly random strain, is of 
paramount importance in determining the ultimate form the modulator 
will take. The effect of random strain is to partially destroy the spatial 
coherence of the modulated wave without affecting the temporal co- 
herence. As a result the performance of any modulation-detection system 
which involves the mixing of two optical signals will be degraded, and 
we show in fact that the only system largely unaffected by strain is a 
phase modulator followed by an optical discriminator and quantum 
counter, / . 

The organization of the remainder of the paper is as follows: In Sec- 
tion II we consider in general the basic phase modulation process in the 
zigzag modulator, obtaining expressions for the time-varying optical 
phase shifts which include the effects of microwave loss and lack of syn- 
chronism between optical and microwave signals. In Section IIT we use 
these results to obtain expressions for the modulation sensitivity of prac- 
tical phase modulators. In Section IV the same thing is done for ampli- 
tude modulators. The practical implications of such effects as diffraction, 
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strain, optical losses, etc. are discussed in Section V. Finally, in Section 
VI we outline a procedure for selecting the optimum modulator dimen- 
sions and operating temperature so as to obtain the most efficient trans- 
fer of information onto the optical beam. In this same section we con- 
sider the question of modulator bandwidth. 

We conclude with a tabulation of the performance to be expected from 
presently available materials in the zigzag modulator configuration. Our 
results show that it should be possible to build a modulator using either 
the quadratic effect in strontium titanate or the linear effect in cuprous 
chloride that will provide 50 per cent linear modulation over a 10 ge 
bandwidth for modulating powers less than about 5 watts. Two factors 
contribute to the high modulation sensitivity in strontium titanate. 
These are: (7) use of a dc bias results in a large effective microwave elec- 
tro-optic coefficient, and (iz) a long optical path is obtained in a small 
volume of material, resulting in a very efficient use of microwave power. 


II. GENERAL THEORY 


The configuration analyzed is shown in Fig. 1. The light is assumed 
to propagate in the yz plane, between mirrors located in the zy plane, 
and to have its electric vector polarized either perpendicular to or parallel 
to the yz plane. The modulating wave propagates in the y direction, along 
the axis of the modulator.* We may distinguish two cases. In the first 
(see Fig. 2a) the electric vector of the modulating field lies in the optical 
plane of incidence (this we call the 0° modulator). In the second (see 
Fig. 2b) the modulating electric field is perpendicular to the optical plane 
of incidence (90° modulator). A TEM modulating wave can be propa- 
gated in the medium with either polarity if the mirrors are made of non- 
conducting dielectric multilayers. 

In the presence of the modulating field and any dc bias, the optical 
index ellipsoid’ is assumed to take the form shown in Fig. 3(a),} with 
the crystalline axes of the material aligned so that the principal axes of 
the ellipsoid coincide with the coordinate axes.{ The significance of the 

* For convenience, we choose to take the xz plane as the transverse plane of the 
modulator and the y axis as the direction of propagation or modulator axis. By 
aligning this Cartesian system with the principal axes of the index ellipsoid of the 
modulating medium, it can be shown that the only effect of changing the direction 
of propagation from, say, the y to the z axis is to cause a sign reversal in the re- 
tardation of the optical wave. 

+t By this assumption, we exclude cases in which the electro-optic effect takes 
the form of a field-dependent orientation of the index ellipsoid. It can be shown 
that this mode of operation is much less effective than the one in which the ori- 
entation of the index ellipsoid remains fixed, and only the lengths of the axes of 
the ellipsoid change. 


t The crystal cut necessary to achieve this alignment for each material is given 
in Appendix A. 
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Fig. 2 — Relative orientation of modulating electric field and optical plane of 
incidence in the two zigzag modulator configurations: (a) 0° modulator; (b) 90° 
modulator. 


index ellipsoid is that it defines the wave vectors and planes of polariza- 
tion of the two normal modes which can propagate through the medium 
in any given direction. A plane perpendicular to the wave vector pass- 
ing through the center of the ellipsoid cuts it in an ellipse whose major 
and minor axes define the directions of the two orthogonal planes of 
polarization. The lengths of these axes are equal to the reciprocals of 
the two indices of refraction. In the configuration shown in Fig. 1, the 
two normal modes have their electric vectors polarized respectively 
parallel and perpendicular to the plane of incidence. The two indices 
of refraction we denote by n, and n, ; from Fig. 3(b) it is evident that 


1/n,’ = 1/n. (la) 
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and 


1 cos? 6; . sin? 6; 
a a 2 i 2° (1b) 
Ny Ny Nz 





As an optical plane wave traverses the modulating medium, its phase 
velocity varies continuously with time. A convenient way to deduce the 
time dependence of the over-all phase shift, and henee the phase modu- 
lation, is to follow the trajectory of a fixed point on the wave front as it 
moves through the medium. This is similar to the method of character- 
istics used to treat nonlinear and parametric wave propagation problems 
and is valid whenever the phase of the wave or the eikonal of geometrical 
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Fig. 3 — (a) Orientation of index ellipsoid with respect to coordinates of modu- 
lator; (b) projection of index ellipsoid on optical plane of incidence. 


LIGHT MODULATORS 2627 


optics is large. When this condition is satisfied the wave propagates 
approximately as a plane wave. This can be demonstrated by expand- 
ing the eikonal in a power series over small space regions and time inter- 
vals. It is then correct to say that the time taken for a point on the 
wavefront to advance a distance y along the modulator axis is given by 





noy 
a Gys c sin 0; (2) 
where c is the velocity of light in vacuo, mo is the refractive index in the 
absence of any electric field, f is the starting time at y = 0, and 0; is 
the angle between the light ray and the z axis. This simple expression 
replaces a complicated integral equation and is correct to first order in 
the fractional change An/n in the index of refraction, and hence is valid 
only when | An/n | < 1, as is usually the case. 
The value of the wave phase at any point in space is given by the 
integral 


- $2 dt 
es ef, Up(E) 


where w/27 is the optical frequency, and ¢ is measured in the ray direc- 
tion. This equation can be written in terms of the component of the 
phase velocity vector in the y-direction* as 


e 8S 
Sue dy Qa | 
oe oy 7 Vy(yt) do Sin 6; Yo nyt) dy (3) 


where Apo is the free-space optical wavelength and S the modulator length. 
Equation (3) applies separately to both normal modes of the electro- 
optic medium when the appropriate index of refraction n is used. How 
nm depends on the applied electric field for the various materials and 
configurations of interest is derived in Appendix A. 

By substituting (2) into (3) and utilizing the expressions derived in 
Appendix A for n(y,t), one can calculate the time dependence of the 
wave phase correct to first order in the fractional change An/n. As long 
as the modulator length is bounded, as it must be if diffraction effects, 
etc., are to be minimized (see Section V), this calculation should give 
accurate results since | An/n | < 1. 

In the remainder of this section we make use of (2) and (3) to obtain 
expressions for the wave phase as a function of time for the two normal 
modes of the modulating medium. These expressions describe the basic 

* Note that when dispersion can be neglected the y component of the optical 


phase velocity vector is just the group velocity of the optical energy flow along 
the modulator axis. 
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modulation process within the medium. In subsequent sections we shall 
go on to describe the practical considerations involved in the actual 
realization of phase and amplitude modulators. 


2.1 Linear Electro-Optic Effect in Cubic Materials 


2.1.1 0° Modulator 


For this case the two modes have refractive indices given by (1) and 
(104) 


1 1 1 

ne ee no (1 + no Tulz)* (4) 
1 1 i 

ny = wis (1 — norul, cos’ 0;)' (5) 


where 74, is the linear electro-optic coefficient (see Appendix A). For an 
attenuated modulating wave of the form 


E.(y,t) = Ene °™ cos (amt — Bn) (G6) 


the phase of the wave polarized perpendicular to the plane of incidence 
is approximately given by 








¢ (to) 

2 : = 
~ aa ee | {1 — Ano TiLm € me cos [wmto + (Be = Bm) yl} dy (7) 

No sin 4; Jo 
where for convenience we have introduced the notation 

Nom 
uss = £B.. (8) 
c sin 6; 


The approximation holds when | noralim | K 1. Equation (7) can be 
integrated directly, giving 


Qn No _ on no 





eat — Am S8/2 
elt) = No sind; Ao sin 0; alin € 
sinh? 2 ams + sin’ ey (9) 
x = eee cos (wml + y) 


where 8 is defined by 
B = Be ae Bm (10) 
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and yw is a time-independent phase factor. In the absence of microwave 
losses this reduces to the result obtained earlier by Kaminow and Liu.” 
It is evident that the phase modulation described by (9) is maximized 
by the choice 8 = B — Bm = 0, corresponding to the matching of the 
y-components of the microwave and optical phase velocity vectors. 
Thus, it is clear that microwave losses do not affect the basic velocity 
matching condition. Under this condition (9) reduces to 

Qa Nos 


ans 
e,(t) No sin 6; 





[1 — Ano raEmf(amS/2) COs wn] (11) 


where for the sake of brevity we have defined the function f(a,S/2) by 


(RIC). 


For the other wave, whose plane of polarization lies in the plane of in- 
cidence, the corresponding expression is 


2m _noS E + 4noTuln cos’ 6:f (:=) cos ont | . (18) 


y(t) = x ae 





2.1.2 90° Modulator 


The analysis for the 90° modulator is formally identical, except that 
(105) is used for the principal indices of refraction, in place of (104), so 
that 


1/ny = 1/ng (14a) 
and 
en = = (1 — nm ru, cos 26;)'. (14b) 
Ny = No 


The results for a modulating wave of the form given by (6) are 


27 nS 
t —>— —_—, z = 
p(t) i ane constant (15) 





and 





= 2a NoyS i, 2 ; “a8 
g(t) = ney E + 4no rum cos 26;f (238 COS mt |. (16) 


2.2 Quadratic Electro-Optic Effect in Cubic and/or Amorphous Materials 


The analysis for the quadratic effect in the presence of a large dc bias 
is very similar to that for the linear effect. The situation is complicated 
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somewhat by the presence of two independent electro-optic coefficients 
in the case of the quadratic effect in cubic crystals (see Appendix A). 
We consider the two configurations discussed before, in which the modu- 
lating field lies parallel (0° modulator) or perpendicular (90° modulator) 
to the optical plane of incidence. 


2.2.1 0° Modulator 


For this case the relevant refractive indices are [refer to (110)] 


vey ~ (1 + ngool2)* (17) 


Na 
and 
tod : 
a= 2 [1+ nib? (sin 6; + © cos’ a) | (18) 


Ny 


where p; and p2 are quadratic electro-optic coefficients (see Appendix A). 
In order to achieve approximately linear modulation, as well as to en- 
hance the modulation sensitivity, the applied field should consist of a 
large static bias and a smaller microwave field, so that 


E.(y,t) = Eo + Eme°"" cos (wml — Bm). (19) 
Then to first order in L»n/Eo 
EP & Ey + 2EpE ne °"" cos (amt — Bmy) 


and the expressions for 1/n, and 1/n,; become linear in L’,, . The inte- 
grations needed to obtain ¢, and g) are similar to those in the preceding 
section. Again the maximum ac phase shift occurs with velocity match- 
ing, for which case we obtain 


o.(t) = 2% 208 {1 2 iy als E + 2Enf (e=8) cos ont |} (20) 











No sin 6; 
and 
g(t) = en te ‘ — 4no pill (sin 6: + = cos” s) 
Ao Sin 4; pi 


(21) 


~~ | + 2k nf (225) cos ont |} . 


In deriving (20) and (21) we note that because the de bias makes the 
material birefringent even in the absence of a microwave modulating 
field, the ordinary (_L) and extraordinary (||) modes cannot, strictly 
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speaking, be simultaneously matched to the modulating wave. We as- 
sume, however, that the bias is small (| 0 plo | «K 1) so that the dis- 
tinction between ordinary and extraordinary phase velocities becomes 
unimportant. For known materials this condition is met for fields as 
large as 10° v/em—i.e., for any field that the material can sustain 
without dielectric breakdown. 


2.2.2 90° Modulator 


The analysis here is similar to the 0° case, except that the indices of 
refraction are simply [refer to (111)] 


1 1 1 

he = ie (1 + no pill,’)? (22a) 
and 

1 1 2 2\i 

mS Pa! + no poli, ) (22b) 


so that the resulting wave phases are 


she Qn nS (1 ee en E + 2Ein f (2x5) cos ont | (23) 








Xo sin 6; 
and 
Qa NoS Om, 
=a ate 2 
Y | (t Xo an 0; {1 3N0 FoahG | + 2B nf (c=8 ‘) COS ont} - ( 4) 


2.3 Linear Electro-Optic Effect in Uniaxial Materials 
The results here are similar to those obtained earlier for the effect 


in cubic materials. 


2.3.1 0° Modulator 


Iquations (1) and (113) apply here, so that 


1 
rues om Fe + noresLz)* 

i 

1 1 2A 3 20) 
et — no rel. Cos’ 6; aoe" aie 6) 

NI No No ; 


where we have set An = n. — m, which we assume to be much less 
than unity (n,. refers to the extraordinary refractive index). The wave 
phases for the two modes are then given by 


2632 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 





g(t) = 2m noS E + Ano Teslimf (2) cos ont (26) 











Ao sin 6; 
and 
27 NoS 
i)=—-- 
yy (t) ae 
2A S 22 
x E = Sein” 6; — Ano TesLm COs’ 6; f (25°) cos ont | : 
No 2 
2.3.2 90° Modulator 
From (1) and (114) we obtain 
I/n, = 1/n. 
and 
D> (1 = tlre 008 0) 28) 
n| = No — No e3l42 COS G; ( 
so that the wave phases are 
Qr NeS = 
ot) = weary ta constant (29) 
and 
= 2a noS 402 : Om 
g(t) = ere E 4noTeslim COS 26; f (ex) Cos ont . (30) 


III. PHASE MODULATORS 


The preceding results are immediately applicable to the phase modu- 
lator. The maximum modulation sensitivity is obtained when the beam 
is linearly polarized in the direction (_L or ||) showing the greatest effect, 
although unpolarized light can also be used. 

For all the cases discussed in the previous section, the ac component 
of the wave phase can be written in the form 





3 
g(t) = “as ASg(6:) f (2=*) Em COS ont, (31) 
0 


where A is proportional to electro-optic coefficient and g(6;) depends 
on the angle of incidence. The quantity of most immediate interest is 
the modulation sensitivity, which we may define as the ratio of the 
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peak phase shift to the square root of the power input to the modulator.* 
To make the problem definite, we consider a parallel-plate line of spac- 
ing h and width w, filled with a medium of microwave dielectric constant 
x’. If we ignore fringing fields,t the power flow is given in terms of the 
electric field by 


p= ED aw, (32) 





where 7 = 3877 ohms is the impedance of free space. The modulation 
sensitivity is then 


M _ Qano F AmS 2n 
4-22 awn (Fg 


where JM is the peak ac phase shift. Since the angle of incidence is deter- 
mined in terms of material parameters by the velocity matching condi- 
tion 8. = Bm, which reduces to sin 6; = o/~/«’ for the present geometry, 
(33) can also be written as 


M | 2xn’ No Ams 2Qn 
ela aCe) me 
where p(1/*/x’) is a function of (no/*~/x’) which replaces g(6;). Values 
of A, g(@;) and p(m/~/x’) for the cases of interest here are given in 
Table I. In this table, the linear electro-optic coefficient which we write 
simply as r;; is to be taken to be ra in the case of cubic materials, and 
fe3 for uniaxial crystals. 
Numerical results for the power required in practical cases are given 
in Section VII. 














IV. AMPLITUDE MODULATORS 


In this section we treat the problem of obtaining broadband ampli- 
tude-modulated light by suitably combining the two simultaneously 
phase-modulated normal modes of a transparent electro-optic medium. 
In the ensuing treatment we will consider only the intensity variations 
in the modulated beam. In so doing we are ignoring the phase informa- 
tion that would be recovered, for example, by an optical heterodyne re- 


* The advantage of defining a modulation sensitivity in this way is that it is a 
function of the microwave-optical properties of the electro-optic modulating me- 
dium and is expressed in terms of power which is easily measured. 

} The effects of fringing fields and the problems associated with non-TEM mode 
propagation in this structure are considered in Section 5.1. 
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Tasie I — Funcrions APPEARING IN (33) AND (34) FoR 
Various PHase MopuLtator CONFIGURATIONS 









































Modulator Mode A g(83) p(n0//x") 
Linear, 0° 1 dry 1/sin 6; /x'/n0 
! : cos? 0; a/x! 1 no? 
ays sin 6; no Kk’ 
Linear, 90° als 0 = — 
; cos 26; /x! Qno? 
I aris : eae dae he 
sin 6; No K 
Quadratic, 0° L polo 1/sin 6; V/«'/N 
. Ne No 
sin? 6; + = cos? 6: (2 
Fz p1 K pi K 
cr sin 6; No 
Vx! 
Quadratic, 90° ab pilio ara 7 
| polo 1/sin 0; / Kk /no 








ceiver, and in effect are assuming that the receiver is a simple quantum 
counter. Thus we look for modulation of the form 


I(t) = Io(1 + M cos wnt) (35) 


where Jy is the average light intensity and M is the modulation index. 

Cubic materials are of particular interest as amplitude modulators 
since they are isotropic in the absence of an applied field and become 
uniaxial only upon application of a field. This is in contrast to what 
takes place in materials with lower degrees of symmetry where naturally 
occurring birefringence exists. The significance of starting with an iso- 
tropic crystal, as opposed to a uniaxial or even biaxial crystal, is that in 
the former no static retardation is accumulated between the two normal 
modes. Hence any ac difference in phase produced between these modes 
can easily be converted into useful intensity modulation. The presence 
of a static contribution to the differential phase shift can cause the in- 
duced ac birefringence to be masked out due to the angular spread of 
the beam, constructional uncertainties or finite source linewidth. (See 
Section 4.3.) 

There are two simple ways in which a time-varying phase shift be- 
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tween two orthogonally polarized light beams may be converted into 
approximately linear intensity modulation. One of these is most useful 
when there is no de retardation between the two beams, and the other 
when there is a controllable de retardation, as in the biased quadratic 
effect modulator. 


4.1 Amplitude Modulation Using the Linear Electro-Optic Effect in Cubic 
Materials 


The first method of producing amplitude modulation is shown in 
Fig. 4. Circularly polarized light is produced by passing the linearly 
polarized incident beam through a quarter-wave plate in the manner 
shown. 

The light emerging from the quarter-wave plate may be decomposed 
into two orthogonal normal modes, one aligned with the fast or ¢ axis 
of the quarter-wave plate and the other aligned with the slow or & axis 
(see Fig. 4). Both normal modes are of equal amplitude but separated 
in phase by 90°. In order to obtain the desired modulation, the plane 
of transmission of the analyzer must make an angle of 45° with the plane 
of incidence. 

For the arrangement given in Fig. 4 one can easily show that the ratio 
of transmitted to incident light intensity is given by 


T,/Ty = 3[1 + sin (gy, — @)]. (36) 


E, 





FAST 1 
st (6) PRISM 


QUARTER-WAVE MODULATOR ANALYZER 
PLATE CRYSTAL 


POLARIZER ~ ax 


Fig. 4 — Arrangement for obtaining linear amplitude modulation for cubic 
crystals exhibiting a linear electro-optic effect. This arrangement can also be used 
with uniaxial crystals if the quarter-wave plate is replaced by a suitable optical 
compensator. 
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For the modulators using the linear electro-optic effect in cubic crystals, 
(11), (13), (15), and (16) yield 





3 
Ore = HE BP albaF (28) COS Wmt (37) 
do 2 
where 
2 
ea (0° modulator) 
sin 6; 
g(9i) = aa (38) 
il (90° modulator). 
sin 6; 


To obtain a specific expression for the modulation sensitivity, which we 
define as | M/-~/P | as before, we may again consider the simple parallel 
plate geometry discussed in Section III. It is evident from (36) and (37) 
that the modulated signal will in general consist of an infinite number of 
sidebands whose amplitudes are determined by Bessel functions. For 
simplicity we limit ourselves to the case of small modulation index (viz., 
les — | < 7/6) so that only the first sideband is important. We 
then obtain for the modulation sensitivity 





with 
2—- no? /K’ 
iE i Se ° 1 
(2%) 7 mie (0° modulator) i 
SAVE) © A = 2net/e (90° modulator) 
Ve modulator). 


In this expression we have again used the velocity matching condition 
sin 6; = no/~V/«’ to replace g(6;) by p(no/Wx’), and have replaced the 
electro-optic coefficient with the more generally useful half-wave retar- 
dation voltage, V; , defined by* 


_ _ Yo 
2nePra 


V; (41) 


Once again we defer computing actual numbers until Section VII. 


* Note that when applied to the transverse electro-optic effect V; need not be 
the actual voltage applied to the crystal to obtain 7 phase retardation, which de- 
pends on the geometry of the crystal, but is to be viewed as a convenient collection 
of constants. 
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4.2 Amplitude Modulation Using the Quadratic Electro-Optic Effect 


Use of the quadratic effect in conjunction with a de bias to provide 
amplitude modulation differs from the linear case just considered in 
that there is now a bias-field dependent static phase difference produced 
between the ordinary and extraordinary rays. Approximately linear 
intensity modulation can be obtained in this case by placing the active 
medium between crossed polarizer and analyzer, as shown in Fig. 5, 
provided that the dc bias is adjusted so that operation occurs about the 
point of half-transmission. For the arrangement of Fig. 5, the ratio of 
transmitted to incident light intensity is given by 


I./Ty = 3[1 + cos (¢1 — ¢)]. (42) 
According to (20), (21), (23), and (24), the phase difference for the 
quadratic effect can be written as 


3 
Len — 1] = Cnn — m2) BoSg (0. E + 2Baf (228) cos ont | (48) 


where 
sin 6; (0° modulator) 
g(9:) = (44) 


1/sin 0; (90° modulator). 


Equation (43) is in the form 9, — 9) = Yac + Yac 3 hence (42) can be 
written: 


I./I = 3[1 + COS Yde COS Yae — sin Yde sin Yacl- (45) 


Z Z 
45° >I |. 45° 
\ ¥ f) E ' “%. 
Kh Wash 
bs 
—pP 





YZ Et 
Y y > 45° 
d \ = 
_ — 
Ko 
\ PRISM ‘SPRISM / 


POLARIZER MODULATOR ANALYZER 
CRYSTAL 


Fig. 5 — Arrangement for obtaining linear amplitude modulation in materials 
exhibiting a quadratic electro-optic effect. 
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For Yac < 7/6 (small modulation) this becomes, approximately 

Tif/lp 401 — Yac sin Yae)- (46) 
Maximum modulation sensitivity results when the operating point is 
chosen so that* 


ano (pr = p2) By Sg (;) = 
Ydae = Xo ~ 


The point that should be noted here is that for the linear Kerr effect 
modulator there are modes of operation and that these modes, which 
are designated by the index m, constitute an independent operating 
variable. 

For the operating points defined by (47), the modulation sensitivity 
for the parallel-plate geometry can be written in the form 





(2m +1)5, m = 0,1,2,---. (47) 





where 


No 4 ; 
( .: ) PU, (0° modulator) ' 
PVE) Vf ” 


a (90° modulator). 
0 





In deriving this equation | we have, as before, made use of the synchronism 
condition sin 0; = o/*~/x’, and have replaced the electro-optic coefficient 
(p1 — pe) by the more familiar Kerr constant 


Kz no'(or — p2) (50) 
2)o ; 


It should be noted that the de field is not an independent variable, but, 
because it determines the operating point [see (47)], is limited to the 
discrete values 


| ED 
V ¥5°(%) 


In practice Hy) would be made as large as the dielectric strength of the 
material would permit, subject to (50). We should point out that the 


Ey = (51) 


Nol Re 


* Tt is obvious that the arrangement shown in Fig. 4 could also be utilized here 
if desired; the only difference would be that the quiescent operating point would 
be chosen to make the de part of ¢, — ¢, in (43) an odd multiple of 7. 
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effects of electrostriction at high de fields will undoubtedly result in 
electrostrictively induced birefringence. To the extent that the electro- 
striction is uniform, it is equivalent to a change in the Kerr constant, 
and can be compensated for by tuning Hy). On the other hand, nonuni- 
form electrostriction can degrade the modulator performance in the 
same way as nonuniform strain. (See Section 5.6.) 


4.3 Amplitude Modulation Using the Transverse Linear Electro-Optic 
Effect in Uniaxial Crystals 


Use of the zigzag configuration to produce amplitude modulation 
with noncubic materials is complicated somewhat by the presence of a 
large static phase retardation due to the natural birefringence of the 
material. When the modulator is used with well-collimated monochro- 
matic light, however, the effects of the natural birefringence can in 
principle be eliminated by means of an optical compensator, thereby 
making it worthwhile to consider this case as well. 

The physical arrangement used in conjunction with the linear effect 
in cubic crystals, and shown in Fig. 4, is also applicable here if we re- 
place the quarter-wave plate by a suitable compensator.” The compen- 
sator allows the phase of one optical mode of the modulator to be varied 
relative to that of the other. 

For this arrangement the ratio of transmitted to incident intensity 
is given by 


I,/Iy = 3[1 + cos (9, — ¢ + ¢e)] (52) 


where ¢, is the relative phase shift introduced by the compensator. The 
phase retardation in the modulator is given by (26) to (30), and has 
the form 


3 
i eee 2 a Tors 189(05)j (2°) COS (wmt) + 9s (53) 


where g(6;) is given by (88) of Section 4.2 and g, is the static retarda- 
tion, given by 





= oe sin 6; (0° modulator) 
0 


Ps = (54) 


T aus ae (90° modulator). 
Ao sin 6; 


It is evident from (52) and (53) that maximum linear amplitude modu- 
lation is obtained when the compensator phase shift is adjusted so that 


Set Gs = (2k oe 1)x/2, k= 0, 1, 2, 3, baa | (55) 
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for which case (52) becomes 
Ti/To = $1 — (—1)®™ sin (94 — ¢y)ecl- (56) 


With this adjustment of ¢, , uniaxial and cubic materials are optically 
equivalent as far as modulator operation is concerned [compare (53) 
and (56) with (36) and (37)], so that (39) to (41) for the modulation 
sensitivity can be applied here as well by changing 14; to 1.3 . 

The question arises, of course, as to the practicality of compensating 
for the natural birefringence over a finite aperture; if the modulator is 
to function properly, the static retardation ¢, must be the same, within 
a fraction of a radian, for all rays comprising the beam. Three factors 
are involved here: 

(t) monochromaticity of the optical source, 
(77) dimensional tolerances, and 

(777) angular spread in the incident beam. 

To illustrate the problem we consider the 0° modulator in KDP. In this 
configuration the static retardation is given by 


¢g; = 4r a sin 6; . (57) 
0 


If we neglect dispersion in An, a range of wavelengths Ado in the source 
leads to a spread in ¢, of 














AX 
| Ay, | & ¢s - (58) 
0 
while a variation AS in the modulator length results in a spread 
AS 
Ags © @s s° (59) 


Under typical circumstances (An = 0.04, S = 4em, 6; = 19°, \y = 0.6 n; 
see Section VII) ¢, is of order 10° radians, so that the requirement 
Ay, < 1 implies that the source monochromaticity and modulator di- 
mensions be maintained within a few parts in 10°. Although the fore- 
going tolerances are tight, they are attainable. 

A more fundamental problem concerns the angular spread in the in- 
cident beam as it traverses the modulating medium. For example, let us 
suppose that the beam divergence can be characterized by a spread A6; 
in the angle of incidence. This will produce a variation in ¢, given by 


Ans 


Ag; & 4a x cos 6;A6; (60) 





C 
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so that the inequality Ay, < 1 requires that 


Ag 


Ad; —____—__—-. , 
= 47rAnS cos 6; 


(61) 
For the example presently under consideration, (61) gives Aé; < 3.2 X 
10 ° radian = 6.5 sec of arc. Whether or not this requirement can be met 
depends on the details of the optical system. If, for example, the modula- 
tor were operated in the far field of an optical maser of output aperture 
D which is smaller than the modulator aperture, diffraction alone would 
produce a spread Aé; & 1.22d)/D, so that the inequality could be satisfied 
only if the beam diameter were greater than 2.3 cm, which is clearly 
unreasonable. On the other hand, if the modulator were placed in the 
near field of the optical maser, (which extends for a distance D’/Ay , 
which is of order 1 m for a 1-mm diameter beam), diffraction could be 
negligible, and strain refraction would then set a lower limit to the mini- 
mum realizable effective A@; , which would in all likelihood still be con- 
siderably in excess of the limit set by (61). 

In view of the foregoing practical considerations it would seem un- 
reasonable to attempt the construction of an amplitude modulator in 
the zigzag configuration using a material such as KDP. Moreover, as 
will be shown in connection with the KDP phase modulator, this ma- 
terial typically requires at least an order of magnitude more power than 
other materials under consideration. 


4.4 Summary of Results 


The results for the various forms of amplitude modulator may be sum- 
marized in a single equation, of the same form as that obtained for the 


phase modulators 
Qano° ( No ) (228) 2n 
= — AS = — = : 62 


M 
Values of A and p(n/+/x’) are given in Table ITI. 

















VP 


V. FACTORS AFFECTING PRACTICAL MODULATOR PERFORMANCE 


It is appropriate before discussing the actual design and performance 
of zigzag modulators to consider some of the practical factors which 
affect the operation of the device. In particular we consider the effects of 

(z) fringing fields 
(iz) deviations from the correct angle of incidence 
(iz) diffraction 
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TABLE II] — Functions APPEARING IN (62) FoR VARIOUS 
AMPLITUDE MopuLatTor CONFIGURATIONS 



































Modulator A g(6:) b(no/V'x') 
29. = 2/,/ 
Linear, 0° Brij a S — 
; pet sin 6; no/V/x' 
Quan Ge, ° (p1 — p2) Eo sin 0; C Sageie 7 
Guadentio: 90° Co eats — p2)Eo 1/sin i a /'x'/N 7 





(tv) optical losses 
(v) lack of parallelism in the reflecting surfaces, and 
(vt) strain. 


5.1 lffect of Fringing Fields 


The calculations made thus far are strictly correct only in the limit of 
a uniform microwave field across the active cross section of the modula- 
tor. Whether this condition is satisfied or not depends upon the ratio of 
the width of the electro-optic modulating medium, w, to the width, H, 
of the parallel plate conductor (see Fig. 6). This ratio, given by 


E = w/H (63) 


determines the filling factor of the dielectric loaded parallel plate guide. 
It can be shown from the analysis of Kaminow and Liu’ that when é = 1 
the field distribution in the modulating medium is uniform and propa- 
gates as a TEM wave. This result is, however, valid only when open- 
circuit boundary conditions prevail at the ends of the waveguide cross 
section.” These boundary conditions will apply physically only to the 
extent that fringing fields can be neglected. This condition can be ex- 
pected to be satisfied when the change in transverse impedance in going 
from the dielectric region to air is large, or alternatively, when the energy 
stored in the dielectric is large compared to the energy stored in the 
fringing fields. As long as the dielectric constant of the electro-optic 
medium is in excess of 10, fringing fields should be negligible and there- 
fore have relatively little effect on the foregoing calculations. 

It is important to note that it is generally difficult to propagate TEM 
or TEM-like waves over a broad band of frequencies in partially loaded 
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two-conductor transmission lines of the form shown in Fig. 6. This is 
because dielectric waveguide modes can be supported in such structures 
at high frequencies. The relative importance of the dielectric modes de- 
pends on the filling factor &. For filling factors less than unity, three 
propagating regions can be distinguished.” The first occurs at low fre- 
quencies where 


Am > Wx’. (64) 


In this region the field distribution in the dielectric region is uniform and 
TIEM-like mode propagation results. If, on the other hand, we have 


Am K Weiw (65) 


then, as shown in Ref. 2, the microwave energy resides primarily within 
the dielectric with a cosinusoidal field distribution, and once again 
propagates as a TEM-like wave. For those cases where \n 4/x'w we 
find that a symmetrical TE,-like waveguide mode propagates with the 
dispersion characteristic of such a mode. 

What is of primary significance here is that for values of the filling 
factor between 


One S41 


the dispersion in the microwave phase velocity is small (from the calcula- 
tions given in Ref. 2 a 30-40 per cent variation in phase velocity results 
for € = 0.5). Moreover, as the filling factor approaches unity, not only 
does the dispersion vanish but the field distribution also becomes uni- 
form. Based upon these considerations it is evident that a parallel plate 
guide completely filled with a moderately high dielectric constant elec- 
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Fig. 6 — Representative microwave electric field distribution in the transverse 
plane of the modulating medium. 
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tro-optic material is a suitable practical broadband microwave guiding 
structure. 


5.2 Sensitivity of the Modulator Performance to Small Changes in the Angle 
of Incidence 


The velocity matching condition requires that the angle of incidence 
be adjusted so that sin 0; = 1o/ +~/x'. To determine the effect of small 
deviations from this value, we return to the general expressions for the 
basic phase modulation processes [(9) and following] and observe that 
in the lossless case (@m = 0) the modulation sensitivity is always of the 
form 

sin BS 


| = Cg(6) aa (66) 


where g(6;) is a slowly varying function of 6;, and the parameter 8 is 
given by [see (8) and (10)] 


8 = fn (VE 1). 


sin 6; 





Neglecting any variation of g(6;), we find that a variation in 6; of 


dm No ne = 

| Aé;| © 06> {1 -—;| (67) 
Sk K 

from the value for perfect velocity matching will reduce the modulation 

sensitivity by a factor of 2. 


5.3 Diffraction Effects 


Diffraction can limit the performance of the zigzag modulator by pro- 
ducing a divergence in the beam as it passes through the modulator. This 
is equivalent to a spread in the angle of incidence which, for satisfactory 
modulator operation, must be less than the value of A@; given by (67). 
Whether or not this requirement can be met depends primarily on 
whether the modulator is operating in the far or near field of the optical 
source. The near field of an aperture of diameter D, illuminated by a 
coherent plane wave extends a distance of order D,’/do . In this region, 
which is about 1 m for a 1-mm diameter beam, there is little spread. At 
distances appreciably greater than this, the beam propagates according 
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to the simple theory of Fraunhofer diffraction, with the main lobe having 
an angular width” 


1.220 
Ds ~ 


Thus, in the worst possible case diffraction could result in an effective 
spread in the angle of incidence of order 1.22\)/D, , and hence limit the 
allowable value of the ratio of modulator length to beamwidth, accord- 
ing to (67), to 


(68) 


66 





S I Mn No no 4 
a a este 

Da ~ 2 Xo x’ (1 x’ (69) 
The aperture D, is further restricted, in terms of the geometry of the 
modulator, by the requirement that the beam remain in the modulating 
medium after the first bounce. From Fig. | it is evident that this requires 
that 


Dz < 2d sin Be, (70) 


where d = h in the 0° modulator, and d = w in the 90° modulator. 
Equations (69) and (70) together require that 
S Am No 1 
i a he 
d = Xo Ki (1 —s no?/k’)* (71) 


5.4 Effect of Optical Losses 


We shall distinguish between two kinds of optical loss. The first arises 
from scattering due to crystalline inhomogeneities and absorption within 
the bulk material, and is described by an absorption coefficient as . The 
second is the reflection loss at the mirrors, which are assumed to have a 
reflectivity R. The optical transmissivity can then be written as 


I/Ty = exp [— agS ese 6; + (S/d) cot 6; In R] (72) 


when (S/d) cot 6;, the number of bounces, is large. The relative im- 
portance of reflection and absorption losses can be deduced by looking 
at the ratio, x, of the quantities in the exponent in (72), i.e. 


yds pas Opin: (73) 
agd 


For x > 1 reflection losses dominate, whereas for x < 1 the absorption 
losses are relatively more important. 
If we require a certain minimum transmissivity, (72) and (73) fix an 
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upper limit to the linear dimensions of the modulator in terms of a loss 
parameter. Thus if x > 1 and we set R“& °° % = 5 in (72), correspond- 
ing to a reflection loss of 50 per cent, we obtain 
S 22 it no/V x! 
d~1— R(1 — ne®/x’)} 
when 1 — R < 1. By comparing (71) and (74) we can establish the 
relative importance of diffraction effects and reflection losses in deter- 
mining the maximum allowed value of the ratio S/d.* We find that the 


reflection losses are the important factor whenever the reflectivity is 
small enough that 


In 2 (74) 


L—-R>oa7 ee 
Xm No 


(75) 
and diffraction effects dominate when the inequality is reversed. For 
Ne = 1 uw, Am = 3 em (X-band), the quantity 0.17 No/Xm I is of order 5 X 
; Binge even with the best of dielectric multilayers” it is difficult to 
en ee ee 
are the important factor in materials for which «’/no S 100, as is nor- 
mally the case. An important exception is a ferroelectric operated not 
far above its Curie point, for which the ratio «’/np can be very large. 
If we now have the situation where x < 1 and we set exp (—asS 
esc 6;) = 1% in (72), corresponding to an absorption loss of 50 per 
cent, we obtain 





8 <= ine (76) 


with no restriction on d. This equation then sets an upper limit on the 
modulator length when reflection losses are negligible. For materials 
having good optical quality where ag is typically of the order of 0.05 em™’t 
such a case might correspond to having R = 0.999 and d } 0.2 mm. 


5.5 Lffect of Lack of Parallelism in the Reflecting Surfaces 


A lack of parallelism in the mirrors causes the light and modulating 
waves to progressively slip out of synchronism, and so degrades the 
modulator performance. The effect may be estimated as follows. 


* By comparing (71) with (74) we are in effect determining the minimum allowed 
value of the ratio S/d for a fixed sideband amplitude. Such a comparison is of sig- 
nificance since the sidebands carry the modulation information and since their 
amplitude will determine the signal-to-noise ratio at the detector. 

+ Accurate values of the absorption coefficients of highly transparent materials 
are not readily available and are liable to depend on the details of individual sam- 
ple preparation. A value ag = 0.05 cm™ is a conservative estimate (cf. Ref. 18). 
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The time T taken for the optical signal to travel the length of the 


modulator is given by 
8 
We as [ dy (77) 
0 vy (y) 


where v, = (c/no) sin 6(y) is the y component of the optical phase 
velocity vector (Fig. 7). If the two reflecting surfaces are inclined at an 
angle y, the angle of incidence changes by w with each successive bounce. 
When the total number of bounces N is large, we may approximate (7) 
by the continuous function 6(y) = 6; + (NwW/S)y, so that the total 
optical transit time is given by 


8 
T= sin (6 + My) ay 
Cc 40 S 


rv “28 sin n 6; + mo AHS os 84, for Ny «1. (78) 


If the optical and microwave phase velocities are matched at the modu- 
lator input (sin 6; = 70/+/x’), the first term entering in (78) is just the 
microwave transit time. Thus at the output of the modulator the optical 
and microwave signals have shifted, in terms of the microwave phase, 
by an amount 


2 
Nowm S” cos’ 6; 
c 2d sin @; 








Ag = wmAT = 


With the synchronism condition sin 0; = 1/Wx’, this becomes 


- rvs 





(1 — no /«')y, (79) 


\ 
»>MIRRORS 





Fig. 7 — Model used in calculating the effect of nonparallelism of the reflecting 
surfaces. 
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where A,» is the free-space modulating wavelength. This phase shift 
must be less than z if we are to avoid stripping modulation off the beam, 
since the effect of the modulating field will reverse as the phase shift 
exceeds z. This leads to a condition on the parallelism of the mirrors 


Adm 1 3 
p< SiS? (= mele). (80) 


5.6 Effect of Strains 


A complete evaluation of the effect of strain on the modulator per- 
formance is very difficult; hence we shall not attempt a complete treat- 
ment here. Nonetheless, since strains can have a profound effect on the 
operation of the modulator, even an approximate treatment such as 
presented herein can be of value. 

In the present discussion it is convenient to consider two broad classes 
of strain, uniform strain and random strain. The effect of a strain uni- 
form throughout the crystal can be described in terms of a photoelastic 
matrix” which must then be combined with the electro-optic matrix in 
order to completely describe the optical properties of the crystal. The 
net effect is to add a static birefringence to that induced electrically.” 
This problem can in principle be solved analytically, although a large 
strain produces an additional complication since it will in general lower 
the crystal symmetry and hence change the form of the electro-optic 
tensor. Although cases of quasiuniform strain sometimes occur,* we shall 
consider here only the case of random strain, since this imposes a more 
fundamental limitation on device performance. 

In a randomly strained crystal where the strain-induced variations 
occur over distances large compared with an optical wavelength, a ray 
in progressing through the erystal will see a continuously varying index 
ellipsoid. If there is little correlation between the shape and orientation 
of the index ellipsoid at two adjacent points, then a statistical model may 
be used to study the effect of the strains. Such a calculation is now being 
carried out and will be reported separately. We confine ourselves here to 
some general remarks, and a simplified analysis which is valid in the 
limit of very severe strain. 

The general effect of random strain is to partially destroy the spatial 
coherence of a plane wave traversing the medium, without affecting its 


* For example, cubic crystals drawn from a melt are sometimes observed to be 
uniaxial, with the growth axis as the optic axis. 
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temporal coherence.* The wave phase at the output of the modulator 
may then be written as the sum of a coherent and incoherent term, i.e. 


P(r,t) = v(t) + d(x), 


where 6(r) is a random variable, and r is some suitably defined transverse 
position vector. Random strains will also mix the two normal modes of 
the unstrained crystal. We shall not consider the details of this mixing 
here, but simply assume that there exist at the output of the modulator 
two orthogonally polarized modes whose wave phases are given by 


®, (r,t) = g(t) ae 6, (4) (81) 
and 


By (r,t) = oy (t) + 5) (2) (82) 


where y, and yg, are the wave phases in the unstrained crystal as previ- 
ously calculated, and 6,(r) and 6,(r) vary more or less randomly across 
the surface of the beam. In this model the effect of strain on the modula- 
tor performance depends on the autocorrelation functions of 6, and 6, , 
and the cross-correlation between 6, and 6, , as the subsequent analysis 
will make clear. These correlations will depend on the state of strain of 
the crystal, and on the length of the optical path, becoming less as the 
path length increases. 

The electric vectors at the output of the modulator can be written in 
the form 


I 


é, (r,t) = E, exp {jlet + ¢,(t)] + jos(1)} (83) 
and 
ey (r,t) = Ey exp {jlwt + oy (t)] + 761 (r)} (84) 


where HZ’, and EF are the wave amplitudes. From these equations we can 
show that although strains do not affect the basic phase modulation 
process, they do affect the over-all modulator-detector system perform- 
ance. Details depend on the system used, but the following general state- 
ment can be made: Any optical communication system in which the 
modulation or detection process is based upon the heterodyning or homo- 
dyning of two optical signals requires that the two signals have relative 
spatial coherence. Any process that tends to destroy that coherence, such 
as random strain, will degrade the system performance markedly. It is 


* The fact that in general strain will also degrade the collimation of the beam 
is of importance in some applications. See Appendix B.3. 
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therefore necessary to be able to evaluate which system will allow us to 
recover the modulating intelligence with maximum certainty. All modu- 
lation-detection schemes presently under consideration fall into three 
classes 

(zt) amplitude modulators followed by quantum counters, 

(7t) phase or amplitude modulators followed by optical heterodyne or 
homodyne detectors, and 

(zi7) phase modulators followed by optical frequency discriminators 
and then by quantum counters. 
Starting with (83) and (84), we analyze these three classes in Appendix 
B. The results are summarized below. 


5.6.1 Amplitude Modulators followed by Quantum Counters 


As discussed in Section IV, the amplitude modulator consists of a sys- 
tem in which two phase-modulated waves of the form given by (83) and 
(84) are combined in an analyzer. As shown in Appendix B, the time- 
varying intensity transmitted by the analyzer is proportional to an in- 
tegral of the form [see (118)] 


bes {ax(e) - i(0)} da (85) 
the integration being taken over the cross section of the beam. In the 
case of severe random strain, when 6, and 6, can independently take on 
all values (i.e., no cross-correlation between 6, and 6, ), the integral (85) 
vanishes and there is no amplitude modulation. More moderate strain 
would result in a nonzero depth of modulation which would, however, 
be less than that attainable in a strain-free crystal. 


5.6.2 Heterodyne or Homodyne Operation 


In heterodyne or homodyne operation the modulated signal is mixed 
with a second locally generated optical signal to obtain a microwave in- 
termediate frequency which is then detected conventionally. Assuming a 
square-law detector and a strong local oscillator having both spatial and 
temporal coherence, the output signal at the difference frequency is pro- 
portional to [Appendix B, (121)] 


cos 
i] on os 1) da (86) 
which vanishes in the limit of very bad strain where 6, and 6, can take 
on all values. 
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Thus in this mode of operation, too, strains result in a reduced detector 
output. The effect here is likely to be more severe than in the case of the 
simple amplitude modulator-quantum counter system, since for a given 
degree of strain the autocorrelation of 6, or 6, is expected to be less than 
the cross-correlation of 6, and 6, . Thus, in a given crystal the integral 
(85) is liable to have a larger value than the integral (86). 


5.6.3 Phase Modulators Followed by Frequency Discriminators 


In the third system under consideration phase modulation is directly 
converted to amplitude modulation in some frequency-selective optical 
circuit’’’’ of which a Fabry-Perot etalon is perhaps the simplest example. 
Spatial coherence per se plays no essential role here. Analytically, as is 
shown in Appendix B, the integral analogous to the ones given by (85) 
and (86) which arises in this case has the form [see 124] 


[i exp [761 , i(r)]|° da. (87) 


This of course has a constant value equal to the beam area and can 
therefore not influence the system response. However, strain can still 
adversely effect this system to the extent that the discriminator per- 
formance is degraded by loss of collimation in the transmitted beam. 

Thus strain will always be a problem in optical modulators. In fact the 
preceding considerations show that the presence of severe random strain 
will result in no useful output from those modulation-detection schemes 
which rely on the interference of two beams. In this case only the phase 
modulator-frequency discriminator system has any real chance of suc- 
cess. 


VI. OPTIMUM DESIGN PROCEDURE 


6.1 Choice of Modulator Dimensions 


For the purpose of discussing the optimization of modulator dimen- 
sions as well as the choice of operating temperature, a useful figure of 
merit is the product of the modulation sensitivity and the optical trans- 
missivity, which for all modulators can be written as 


nds 4/2. 3a No AmiS 
t= Beas y/ Fella (BC) 
x exp | - Ves (ay — mane 3 - =). 
No d K 





M 
~ |4/P 











(88) 
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Here d is the distance between the reflecting surfaces (either h or w, de- 
pending on whether the 0° mode or the 90° mode is used), and J is the 
width of the reflecting surfaces; recall that S is the modulator length. 
The remaining functions f(a,S/2) and p(m/~/x’) have been defined 
above in (12) and Tables I and II respectively. Equation (88) applies 
when we have velocity matching. 

The optimum modulator dimensions can now be found by maximizing 
the figure of merit. It is evident that the dimension / should be made as 
small as possible. If the beam diameter D, is specified, this then deter- 
mines the minimum value of J, lmin = D». The mirror spacing d may 
next be chosen so as to maximize (88), the appropriate value being 


m) 2\ 4 
don = —28 In RM (1 = ne) (39) 


No 





The ratio S/d obtained from this equation corresponds to a reflection 
loss of roughly 50 per cent. It is worth pointing out that for practically 
attainable reflectivities, this optimum value of S/d is less than the maxi- 
mum value set by diffraction effects [see (77)], and so represents a reason- 
able design value. 

When the optimum value of d given by (89) is substituted back into 
(88), there results an equation for § which depends only on S; S may 
then be chosen so as to maximize this figure of merit. The optimum 
value, Sop: , is defined by the transcendental equation 

t 

ee a wi oo (90) 

EXP AmSopt — 1 — 2amSopt Ny Om 
In the absence of optical absorption (ag = 0), the optimum length is 
given by amSopt = 1.25. As ag increases, the optimum length decreases, 
as shown in Fig. 8. When the inequality (1/~/x’) (am/as) 1 is satis- 
fied, we find that the optimum length is determined solely by the optical 
attenuation, and is given by 


No 
Sopt al IV kag” (91) 





The dimensions of the modulator are thus completely determined: / 
by the beamwidth, d by the optical reflection losses and S by the micro- 
wave and optical attenuation constants. For materials having sufficiently 
low microwave and optical losses, the optimum length given by (90) 
may be greater than any practically realizable crystal length. Under 
these circumstances one should make S as large as possible, and then 
choose d to satisfy (89). 
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Fig. 8 — Dependence of optimum modulator length on optical and microwave 
loss parameters. 
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6.2 Choice of Operating Temperature 


The dielectric and electro-optic constants of useful modulating ma- 
terials are strongly temperature-dependent. In ferroelectrics, for exam- 
ple, both the electro-optic coefficients and the microwave losses increase 
dramatically near the Curie point. We should thus like to establish 
whether there exists a preferred operating temperature which maximizes 
the figure of merit [see (88)] independently of the modulator dimensions. 

We begin by assuming that the modulator dimensions have been op- 
timally chosen according to the results of the preceding section. With 
this choice of dimensions, (88) becomes 


eS 
Qn ‘4/ n dies (S ) no/V x! | 
aera ee /) - = 
Bont Mu Vx'D, In R aE . V'] Lam(l = nex’) 


< [vend 


where U = a@mSopt is found from (90) and where we introduce the nota- 
tion (as/am)(Wx'/no) = Q. The function g(u) = Vue “f(u/2) is, 
through (90), an implicit function of the independent variable Q. The 
function may be approximated as 





(92) 
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g(Q) = 0.214 Hl ba 


For a TEM-like wave, the microwave attenuation constant is given in 
terms of the dielectric constant and loss tangent of the medium by 


Om = = a/x' tan 6. (93) 


The figure of merit can now be approximated solely in terms of material 
parameters as 


1 
4 1 1 2741 n 
= Mo A o 
Seer ie a ast) o( 75) 
pli 


1 wo -H() 
x P 2 \qrno tan 6 
(1 — 1297/x’)! 


where all the numerical constants, including the impedance of free space, 
n = 377 ohms, have been combined into a single factor. 

Once the temperature dependence of the electro-optic coefficient, di- 
electric constant and loss tangent are known,* (94) can be used, at least 
in principle, to select an optimum operating temperature. 

Further analytical progress requires some assumption of the tempera- 
ture dependence of these coefficients as well as a choice of the mode of 
operation. To be definite we select the mode for which p(no/~V/x’) = 
/x'/ny . As may be seen from Tables I and II, this is the most favorable 
value that can be realized and applies, for example, to the linear, 0° phase 
modulator. 

The temperature coefficient of the linear electro-optic coefficient is 
expected to follow that of the dielectric constant.’’ The reason for this is 
as follows: The electro-optically induced birefringence arises funda- 
mentally from a change in the polarization, rather than the applied field, 
and in fact Pockel’s’® original formulation of the electro-optic effect was 
in terms of constants e;; relating the coefficients of the index ellipsoid to 
the induced polarization. The r;; coefficients used here are related to 
Pockel’s e;; by 


(94) 


rij = €o(K’ >= 1)e;;. 


* The index of refraction, no , to the extent to which it is determined by the 
electronic polarizability, should be almost independent of temperature. 
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Thus for materials having a high dielectric constant we expect r,; to be 
approximately proportional to x’, and to the extent that the coefficients 
é;; are temperature independent,” to have the same temperature depend- 
ence as x’. 

A similar argument in the case of the quadratic effect leads to the con- 
clusion that p should be proportional to «”. Here, however, the situation 
is complicated by the fact that the bias field /) , which for optimum op- 
eration should be as large as the breakdown field strength of the material 
permits, will be temperature-dependent whenever this breakdown field 
strength is temperature-dependent. In many solids we expect dielectric 
breakdown to occur at some definite value of internal field. In materials 
of high dielectric constant the internal field is determined by the polari- 
zation, so that the breakdown field strength, and hence the allowed de 
bias, will be inversely proportional to x’. The net result is that the product 
of ply may be taken proportional to the first power of «’, and the quad- 
ratic and linear effects become formally identical. 

The preceding assumptions allow us to write the figure of merit for a 
given material wholly in terms of its de and optical dielectric parameters, 
so that § becomes proportional to 


Ke BA ma \3 
mes 
(1 — n?/x’)# E oe ( 2Qrno tan ;) | (95) 


We must now distinguish between low and high microwave losses, the 
criterion being the magnitude of the ratio 3Amas/2mno tan 6. For suffi- 
ciently low microwave losses, the exponential factor in (95) will dis- 
appear, so that the figure of merit is proportional simply to «’?/2/ 
(1 — 102/x’)1/4. Since the ratio n,2/x’ will always be substantially less 
than unity, the optimum operating temperature is that which maximizes 
x’. For ferroelectric materials this will mean lowering the temperature to 
the vicinity of the Curie point, 7’.. Of course the losses will normally 
increase as the temperature is lowered, so that at some point the approxi- 
mation of low microwave loss breaks down. For sufficiently high loss 
we may expand the exponential term in (95), obtaining a figure of merit 
proportional to 


1 ct 


(1 = ne’/e’)t Vo tan” (96) 


For ferroelectrics such as KDP”™ or SrTiO;"™ in which both «’ and 
tan 6 are dominated by a (7 — T,)~* dependence, 7’, being the Curie 
temperature, this result says that operation near the Curie point is 
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favored (although not strongly) even in the presence of substantial 
microwave loss. 


6.3 Bandwidth Limitations: Design of Optical Coupler 


The question of the bandwidth of the zigzag modulator has two as- 
pects — microwave bandwidth and optical bandwidth. Since both the 
optical and modulating signals propagate as TEM-like waves, it might 
at first be thought that the bandwidth would be limited only by the dis- 
persion in the material parameters of the modulating medium. For most 
materials this would imply a microwave bandwidth as large as 10 or 20 
gc, and since the fractional optical bandwidth required to accommodate 
even microwave frequency sidebands is very small (of order 107°), op- 
tical dispersion should never be a problem. 

A more discriminating analysis of the question of bandwidth reveals, 
however, that the physical dimensions of the modulator limit the realiza- 
ble bandwidth, especially when these dimensions are optimized so as to 
obtain the ultimate in low-power performance. In an optimum design 
the length S.,,and width (or height) dopt are uniquely determined by the 
electrical properties of the modulating medium both at microwave and 
optical frequencies as well as the loss properties of the medium [see (89) 
through (91) and the attendant discussion]. For highly transparent low- 
loss microwave materials, dy», computed from (89) is nominally in the 
order of a few tenths of a millimeter and S,,; is usually several centime- 
ters (see Table IV in Section VII for typical results). The one dimension 
which remains unspecified is the beam diameter D, . As remarked above 
in Section 6.1, D, should be chosen as small as possible and yet remain 
within the diffraction limits of the device over the length Sopt. Ac- 
cordingly, we find that, based upon certain operational practicalities 
and diffraction limits, D, must normally be restricted to about a milli- 
meter. Thus, as the aperture of the device is typically an order of magni- 
tude larger than the spacing between mirrors, it is impossible to satisfy 
(70); 1.e., the beam bounces back out of the modulator, and hence can- 
not be confined to the active area of the device by means of a simple 
aperture arrangement. In such cases the beam can be confined to the 
interior of the modulator only if we include an appropriate optical coupler 
at the input (and output) of the modulator. 

It is shown in Appendix C that a suitable coupler which allows all of 
the incident optical energy to be transferred into the active medium of 
the modulator is a Fabry-Perot resonator of the form shown either in 
Fig. 10(a) or Fig. 11. The physical operation of the coupler is easily un- 
derstood when one observes that because of its finite length energy must 
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leak off to the right into the interior of the modulator. Then by properly 
adjusting the spacing between reflectors as well as the reflectance of the 
lower reflector it is possible to completely cancel the reflected wave, and 
have the energy trapped in the coupler section walk off into the modula- 
tor. Expressions for the required spacing and reflectance are derived in 
Appendix C. Here we simply point out that such a coupler can be realized 
in a typical case with reflectivities in the neighborhood of 99 per cent, 
which is easily attained, and with dimensions L, and |, of the same order 
as Dz and dopt , respectively. 

We are now in a position to examine the bandwidth limitations im- 
posed by the coupler and modulator dimensions. The bandwidth is gen- 
erally determined by three factors. In order of importance these are 

(z) the spacing 7, and loaded Q of the coupler, 
(ii) the size of D, compared with the microwave modulating wave- 
length, and 

(zit) the microwave impedance of the structure. 

The last factor, microwave impedance, is of practical significance be- 
cause the impedance of the modulator will inevitably be low, and it be- 
comes progressively more difficult to obtain a broadband match into a 
transmission line as its impedance is lowered. However, a number of 
standard microwave techniques are available to overcome this difficulty. 

The second factor, the beam diameter, affects the bandwidth in the 
following way. In order that the microwave modulation be detectable 
in a direct simple way it is necessary that the modulation frequency phase 
be approximately constant across a given section of the beam. Hence it 
is necessary that the projection of the beam cross section along the axis 
of the structure be appreciably less than half the microwave wavelength 
in the medium. Mathematically 

Am 
2 cos 6; / x! 
where i, 1s the free-space wavelength. 

Finally, with reference to the first factor, the coupler spacing and 
loaded Q, we note that in order to avoid microwave impedance discon- 
tinuities in the low-impedance modulator line it is mandatory that 7, and 
d be very nearly equal. If the spacing d is then kept small so as to obtain 
the lowest possible modulating power, the input coupler will limit the 
bandwidth in the following way. The Fabry-Perot coupler will be res- 
onant at a series of frequencies spaced by 


Ds K (97) 


Cc 


- Qnole COS O35. (98) 


Af 
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The base bandwidth 6f of the coupler will be determined by the Q of an 
individual mode, and can be shown from Appendix C to be 

1 e¢ sin 8, V1 — ne sin? 6; for no < Vx’ 

- Tv Ds No 





bf (99) 


‘ = 
cos 0; for m > Vl. 


Since this bandwidth is available in each of the Fabry-Perot modes, 
microwave subcarrier operation at frequencies above this baseband is 
allowed” provided that the subcarrier frequency (or optical sideband) is 
some multiple of the mode spacings given by (98). The problem, of 
course, is that the mode spacings become very large when the coupler 
reflector spacing J, is small, and since /, must be comparable with the 
modulator thickness d, it follows that the lowest allowed microwave sub- 
carrier will be at much too high a frequency when d is adjusted to its 
optimum value of a fraction of a millimeter (e.g., d = 0.2 mm cor- 
responds to Af + 300 gc). It is evident then that the minimum power 
configuration is restricted to baseband operation with a bandwidth de- 
termined either by (97) or (99), whichever is smaller. For D, = 1 mm, 
(97) gives fm < 10 ge when the modulating medium is a ferroelectric 
such as strontium titanate, while (99) gives 6f ~ 5 ge. 

Greater bandwidth can be obtained, at the expense of increased modu- 
lation power, by making the thickness d considerably greater than opti- 
mum. If d is made sufficiently large (e.g., 3.6 mm in the case of the stron- 
tium titanate modulator), a simple input aperture can be used in place 
of the Fabry-Perot coupler.* This removes the frequency limitations set 
by (98) and (99), leaving only that imposed by the finite beam width 
(97) and material dispersion. The frequency limitation imposed by the 
beam width is quite fundamental. In fact, when the projection of the 
beam diameter along the axis of the modulator is exactly equal to a full 
microwave wavelength in the material there results no net modulation. 
We therefore define a cutoff frequency for the modulator as 


c 
fe ~ Ds cos 6; VK" yy 


In summary, then, we find that in order to fully exploit the low modu- 
lation power requirements of the zigzag modulator it 1s necessary to re- 
strict the frequency of operation for most substances to a few ge at base- 
band, and to use a relatively sophisticated coupler to get the optical 


* Tf no® > /x it will be necessary to use a more complex aperture analogous 
to the one shown in Fig. 11. 

+ For CuCl the operating frequency can be as high as about 20 ge (see Table IV 
in Section VII). 
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energy into the device. By relaxing the power requirements the upper 
frequency limit can be extended above 10 ge in a relatively simple struc- 
ture. Examples of both kinds of design are given in the following section. 


VII. DISCUSSION AND CONCLUSIONS 


We have analyzed in detail the behavior of zigzag phase and amplitude 
light modulators (Sections III and IV) as well as the basic underlying 
theory for this class of modulators (Section IT). Our treatment differs 
from the work of other investigators! along these lines in a number of 
crucial ways. We have included microwave losses in the initial calculation 
of the optical retardation and have considered the 90° mode of operation 
in addition to the originally proposed 0° mode for several different classes 
of electro-optic media. The analysis has been detailed enough and has 
treated enough of the important practical factors which limit the opera- 
tion of the device (Section V), so that a realistic optimum design pro- 
cedure can be presented (Section VI). In essence, this procedure requires 
that the dimensions of the modulator be so chosen that all of the incident 
microwave power be confined to as small a volume of the electro-optic 
material as the microwave, optical, and reflection losses will permit. As 
a result, large modulating field strengths are obtained with moderate 
amounts of power. 

Based upon our treatment we find that there are three main aspects 
to broadband modulator design. The first is the velocity synchronism 
condition; here we find that the basic phase velocity matching criterion 
is unaffected by microwave loss. The second is the effect of modulator 
dimensions and aperture on bandwidth. These problems have been dis- 
cussed in Section 6.3, where it was shown that the aperture limits the 
highest modulating frequency in a fundamental way [see (100)] and that 
the other dimensions in effect determine the requisite coupling scheme 
(either Fabry-Perot or simple aperture) and thence the bandwidth. The 
final criterion of significance is the strain properties of the modulator 
crystal (Section 5.6). Crystalline strain is a serious problem in all forms 
of electro-optic amplitude modulators, but is particularly deleterious in 
the traveling-wave modulator because of the long optical path. 

We are now ina position to present some actual designs of zigzag phase 
and amplitude light modulators as well as the performance to be ex- 
pected of these devices. Before doing so, however, we first summarize 
in Table III the pertinent microwave and optical materia] parameters, to 
the extent that they are known, for a number of electro-optic modulating 
crystals which are currently available. With regard to the data presented 
in Table III, much of which is tentative, two points should be noted. 
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TasuEe ITI — Mareriau PARAMETERS FOR SoME ELectTRO-Optic CRYSTALS 


Material Te T (p1 — p2)* rat reat no x! tan 6 Remarks 
BaTiO; | 393°K | 393°K 23 X 10°# 2.4 6000 0.5 Electro-optic data apply at Ao = 5021 A 
403°K | 6.5 X 10°” 2.4 4500 0.4 and f S 200 ke (Ref. 24). Microwave 
data apply at 24 ge (Ref. 25). 
SrTiO; 33°K | 77°K | 0.31 X 107”? 2.409} 1500) 1.25 X 107? | Electro-optic data apply at Ax = 6328 A 
293°K |0.194 * 1078 2.409} 300) 1.5 x 10-3 and f < 1 ke. Data obtained from Ref. 


26. Value for mo applies at \ = 5891 A. 
Microwave data apply at 20 gc (Refs. 
21, 27, 28). 


KTaO; | =1°K}] 4°K | 4.3 X 107? 2.5 4400) 0.72 X 10-*| Electro-optic data apply at \ = 6328 A 
77°K | 0.1 X 10° 2.5 800} 0.2 X 107% and f < 1 ke (Ref. 29). Microwave data 
apply for frequencies below 20 gc 
(Ref. 29). 
CuCl 293°K 6 X 101° 1.93 8.3 1 X 10-3 | Electro-optic data measured at de (Ref. 
30). Microwave data measured at 5 ge 
(Ref. 31). 
ZnS 293°K 2.4 X 10719 2.368} 10.25 2 X 10-3 | Electro-optic data measured at de (2.4 X 
1.2 X 10°19 10-1) and 3 ge (1.2 X 107°). Micro- 
wave data measured at 5 ge. 
KDP 120°K | 173°K 1.468 60| 17.5 X 10-3 | Electro-optic data apply at high fre- 
293°K 6.8 X 10779 1.468) 20.2) 7.5 xX 10-3 quencies, f, > 100 ke, and for 4000 < 
Ao < 7000 A (Ref. 32). Microwave data 
apply at 9.2 ge (Ref. 33). 
ADP 148°K | 173°K 1.479 16) 6.5 X 10-3 | 7. refers to antiferroelectric phase transi- 
293°K 5.5 X 10719 1.479 14 6 xX 10-3 tion (Ref. 34). Electro-optic data 


apply at high frequencies, f > 100 ke, 
and for 4000 < Ao < 7000 A (Ref. 32). 
Cama data apply at 9.2 ge (Ref. 


* Expressed in units of cm?/V’, 
{ Expressed in units of em/V. 


LIGHT MODULATORS 2661 


The first is that the microwave properties for the different materials 
listed have not all been measured at the same frequency. For simplicity 
we shall assume that there is negligible dispersion in the microwave 
parameters. The same assumption is made for the optical data. The sec- 
ond point to be noted is that the data available for the electro-optic co- 
efficients apply in most cases at very low frequencies. Hence the values 
for the electro-optic coefficients represent the unclamped values and 
consequently overestimate in many cases the desired microwave or 
clamped values for these coefficients. Thus this table should be viewed 
simply as a guide to the material parameters. From the data given in 
Table III we select three materials, CuCl, SrTiO; and KDP, as being 
the best in their respective crystalline classes. Each of these materials 
combines good microwave properties (low loss with high intrinsic im- 
pedance) with a large electro-optic coefficient. With reference to SrTiO; 
it should be noted that the quadratic effect exhibited by this material is 
anomalously high, as it is with BaTiO; and KTaQ;, the effect being 
10°-10" times higher than in the best glasses. When one couples this to- 
gether with the fact that the “effective linear coefficient” for quadratic 
materials is py , one immediately sees from the data listed in Table III 
that for Hy > 1000 v/cm, quadratic materials are potentially as good as 
if not better than linear materials. 

It is instructive to carry out two designs. One, based on the optimum 
design procedures of Section 6.1, results in a modulator requiring an 
absolute minimum of modulating power. However this design requires 
the use of an input optical coupler and is inherently narrow-band. In 
another, more practical design, the mirror spacing d is chosen to allow 
the use of a simple aperture. In this design we obtain increased band- 
width at the expense of higher modulating power. Tables IV and V sum- 
marize the design parameters, expected performance, and several of the 
practical limitations for each of these designs. Note that we do not in- 
clude an amplitude modulator design for KDP. The reason, of course, 
is the difficulty encountered in trying to compensate for the static re- 
tardation in such a material (See Section 3.3.) 

In order to carry out the calculations several design parameters were 
chosen independently as a matter of convenience. Thus the optical wave- 
length was taken as the visible red line of the He-Ne gas laser at 6328 A, 
and in the absence of experimental data the optical absorption coefficient 
ag was assumed to have the same value as that found in good optical 
glasses, viz., ag & 0.05 cm”. The multilayer reflectivity was taken as 
0.999, about as high as is practically achievable.” A reflectivity this high 
is essential if the ultimate in low-power modulation is to be obtained in 
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the optimum design, but the requirements can be relaxed considerably 
in the practical design. For example, a decrease in reflectivity to 0.982 
increases the optical insertion loss of the practical SrTiO3 modulator by 
only 1.7 db. 

In the absence of any really complete temperature data we have taken 
room temperature, 300°, as an operating temperature. Although it is 
true that for the ferroelectrics, SrTiO; in particular, the temperature 
analysis of Section VI shows that the modulating power is reduced some- 
what as we approach the Curie point, the concomitant increase in x’ 
reduces 6; below the practical minimum and at the same time reduces 
the available bandwidth by decreasing the microwave wavelength in the 
material [see (97) and accompanying discussion]. Thus 300°KX is in fact 
close to the practical optimum operating temperature for SrTiO; . 

The operating frequencies given in Tables [IV and V have been arrived 
at by the following considerations. A practically realizable aperture 
which remains within diffraction limits is nominally 1 mm. The optemum 
design therefore requires the use of a coupler with restricted bandwidth. 
The data of Table IV are thus calculated for a modulating frequency of 
5 ge, which is within the passband of all modulators. In Table V the 
operating frequency has been chosen as 10 ge and is in all cases less than 
the cutoff frequency. What should be noted here is that by ignoring the 
bandwidth problem, we obtain in the optimum design shown in Table IV 
the irreducible minimum microwave modulating power. In the case of 
the practical design listed in Table V the spacing between mirrors d has 
been increased to allow the use of a simple aperture. The price we pay, 
of course, is in increased modulating power. We should also make special 
note of the fact that from a practical viewpoint the impedance level of 
the design given in Table V is much more favorable than for the one 
given in Table IV. The restriction on the maximum attainable modulat- 
ing frequency in the practical design is the aperture size [see (97)]. 

The column headed ‘‘configuration”’ in Tables IV and V deserves some 
comment. For each material considered, the configuration chosen was 
the one for which the function p(no/V«’) had its maximum value. In 
the case of SrTiO; there are three possible configurations for the phase 
modulator. Table I shows that the 0° phase modulator (PM) in the 
-mode and the 90° PM in either the L or ||-modes are all equally 
suitable choices. We should bear in mind at this point that for SrTiO; 
(as well as BaTiO; and KTaQs3), it is only the difference in the electro- 
optic coefficients |p: — p2| that is known (see Table III) and not p, 
or p2 alone. However, since in an amorphous material p2 is identically 
zero (Appendix A) we expect that in a cubic material | p2 | will still be 
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TaBLe IV — Optimum MopvutatTor PERFORMANCE FOR SEVERAL Evectro-Opric Materiats. MopuLatTING 
fe} 
FREQUENCY, 5 Gc; OPTICAL WAVELENGTH, 6328 A; TEMPERATURE, 300°K 


Material 


CuCl 


SrTiO; 


KDP 


Configuration 


0°-PM; 1-mode 


0°-AM 


0°-PM; 1-mode 
90°-AMT 


0°-PM; 1-mode 


|) | | SE | — | | J —_— 


19.1° 


4,84° 


0.072° 
0.072° 


0.15° 


1mm 
1 mm 


1 mm 
1mm 


1mm 


0.19 mm 


3.3 ecm 


18.7 ge 


Remarks 


4.3 db | 0.315 w| Sopt fixed by optical losses. 
4.3 db 


0.13 w 


db 
db |0.17 w 


4.3 db | 2.8 w 


Dimensions are within 
limits set by diffraction. 


0.17 w | Sop fixed by optical losses. 


Dimensions are within 
limits set by diffraction. 
Results apply for Ho = 
30 kv/em. Computa- 
tions for phase modu- 
lator treat material as 
amorphous. 


Sopt fixed by optical losses. 


Dimensions are within 
limits set by diffraction. 


* The insertion loss does not take reflection losses into account. These can be eliminated in any one of a number of standard ways. 
+ In the case of the amplitude modulator, a bias field of 30 kv/em results in operating at a mode number of m = 600. 
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TABLE V — Practical MoDULATOR PERFORMANCE FOR SEVERAL ELEctRo-Optic MATERIALS. MoDULATING 
Frequency, 10 Gc; OpricaAL WAVELENGTH, 6328 A; TEMPERATURE, 300°K 


Material Configuration 6; 


CuCl 0°-PM; 1-mode | 42.1° 
0°-AM 42.1° 


SrTiO; |0°-PM; L-mode| 8° 
90°-AMt 8° 


KDP 0°-PM; 1. -mode| 19.1° 








2.42° 





0.61° 





1mm 





1.53 mm 





3.3 cm 





te 


140 gc 


140 ge 


17.5 ge 
17.5 ge 


70.7 ge 





IL* 


Nw 
or Kor) 
jomgon 
omen 


ae 
lor zor) 
2.2. 


2.6 db 


P(M = 3) 


Remarks 


1.6 w | S fixed by optical losses. 


0.66 w 


Dimensions are within 
limits set by diffraction. 


3.1w |S fixed by optical losses. 


3.1w 


23 w 


Dimensions are within 
limits set by diffraction. 
Results apply for Hy = 30 
kv/em. Computations 
for phase modulator 
treat material as amor- 
phous. 


S fixed by optical losses. 
Dimensions are within 
limits set by diffraction. 


* The insertion loss does not take reflection losses into account. These can be eliminated in any one of a number of standard ways. 
} In the case of the amplitude modulator a bias field of 30 kv/cm results in operating at a mode number m = 600. 
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substantially less than | p; |, so that we may take the measured value of 
| p1. — p2 | aS an approximate measure of | p; | alone. This has been done 
wherever necessary. 

The remaining parameters listed in Tables IV and V require little or 
no further comment. The angle of incidence @; necessary to achieve 
velocity synchronism is given together with the deviation in 6;, | AG; |, 
which results in a 3-db degradation in modulation sensitivity. The angle 
Ymax 1S the maximum permissible angular deviation in the parallelicity 
of the mirrors if the performance is to be degraded by less than 3 db. 
The modulator dimensions are contained in Dz, dopt or d, and Sopt or S 
depending upon which design is being considered. The column headed 
IL in the tables refers to the optical insertion loss suffered in traversing 
the modulator. Finally, the quantity P(JZ = 3) is the microwave modu- 
lating power required to achieve a modulation index of 0.5. 

It is worth mentioning at this point that Tables IV and V implicitly 
contain several idealizations. The first is that the erystals are strain-free. 
For KDP this assumption is approximately justified, but for CuCl and 
SrTiO; it is far from reality and, as noted previously, could seriously de- 
grade the performance of the amplitude modulator. The second idealiza- 
tion comes from the assumption that the modulating field is uniform 
over the entire cross section of the crystal. In any practical modulator 
this will not be strictly true. However, to the extent that fringing fields 
can be neglected in a parallel plate line completely filled with electro- 
optic material, this assumption is realizable (see Section 5.1). In conse- 
quence, the calculations made for the required modulating power should 
be reasonably accurate. The primary faetor which would cause an in- 
crease in modulating power is a deviation from the correct angle of in- 
cidence. However, the tolerances in 6; given in Tables IV and V are 
sufficiently large that maintaining the required angle should present no 
problem. Finally we should point out that we have assumed that a de 
bias as large as 30 kv/cm can be applied to the SrTiO; sample without 
dielectric or interfacial breakdown, and that the electro-optic effect does 
not saturate at these, high fields. It is clear that a high de bias field is 
desirable with the SrTiO; modulator, since this increases the effective 
linear electro-optic coefficient pK . 

The results given 'in Tables [IV and V demonstrate that broadband 
microwave modulation of light can be effected with very little modulat- 
ing power. In a practically realizable design, we find that a l mm X 3.6 
mm X 1.4 cm rectangular slab of SrTiO; biased with 30 kv/em should 
produce 50 per cent linear modulation of light with less than 5 watts of 
power at any modulating frequency from de to 10 ge. A CuCl slab of 
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dimensions 1 mm X 0.75 mm X 6.7 cm should require still less power 

1-2 w) without the need of any de bias. At the expense of reduced 
bandwidth and increased optical complexity, this power can be reduced 
to less than a watt by a correct choice of the modulator dimensions (op- 
timum deisgn). In the zigzag configuration, CuCl appears to be slightly 
better than SrTiO;, in the sense of achieving moderately broadband 
performance with low modulating power. Cuprous chloride has the ad- 
vantage of extreme bandwidth, of not requiring any bias, and of not 
needing reflecting mirrors since 6; exceeds the critical angle for total re- 
flection. Compared with cavity-type modulators using KDP* or ADP*® 
the CuCl zigzag modulator typically requires three orders of magnitude 
less drive power and has up to two orders of magnitude greater band- 
width. Compared with other traveling-wave systems,’ or with those 
iterative structures proposed to date,’ it is substantially superior from 
the viewpoints of size and power. 

As far as the practical realization of these zigzag modulators is con- 
cerned, we believe that there are no insurmountable constructional prob- 
lems. However, it should be noted that at the present time single crystals 
of suitable cubic electro-optic materials of the required optical quality 
and size are not readily available. Continuing materials research, how- 
ever, indicates that this situation should improve. 

We should point out that it appears that high dielectric constant cen- 
trosymmetric paraelectrics are particularly well suited to the zigzag con- 
figuration. This stems from the fact that the dielectric constant of these 
materials is intermediate between those of conventional dielectrics and 
those of ferroelectrics, and their quadratic electro-optic effect is large. 
The dielectric constant is large enough in these materials that the angle 
required by the synchronism condition is reduced to the smallest prac- 
tical value. This results in a long optical path in a crystal of modest 
physical size, and hence in very efficient use of modulatng power. The 
existence of a large quadratic electro-optic effect allows us to obtain a 
very large effective linear coefficient by using a suitable de bias. It is of 
interest to note in this connection that the performance of the SrTiO; 
modulator can be improved somewhat by going to lower temperatures. 

In closing, we observe that the volume of active material required in 
the zigzag modulator can be cut in half by using the double-pass ar- 
rangement shown in Fig. 9. Here both the light and the modulating sig- 
nal are reflected from the end of the modulator, with velocity synchron- 
ism being maintained for both incident and reflected waves. In this way 
the physical length of the crystal required to achieve a given electrical 
length can be halved. 
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Fig. 9 — Double-pass zigzag modulator configuration. 
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APPENDIX A 


In this appendix we derive expressions for the electric field dependence 
of the refractive indices of the two normal modes used in the zigzag 
modulator. 


A.1 Linear Electro-Optic Effect in Cubic Materials of Symmetry Ta 


For this class of materials, to which cuprous chloride belongs, the equa- 
tion of the index ellipsoid relative to the crystalline axes x’,y’,2’ is” 


a (a? by”? + 2) + Qry(Bary'2’ + Eye's’ + Eya'y') = 1 (101) 
0 

where 7 is the index of refraction in the absence of the electric fields 
Ey , Hy ,and [. , and rg is the electro-optic coefficient. For an electric 


field applied along the 2’ axis (0° modulator) in a crystallographic [001] 
direction, the equation reduces to 


a (a? oy”? + 2”) + rk e'y) = 1 (102) 
0 


which is the equation of an ellipsoid having one of its principal axes along 
the 2’ axis, and the remaining two in the 2’y’ plane at 45° to the 2’ and 
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y’ axes.* In a coordinate system x,y,z coincident with these axes the 
index ellipsoid has the normal form 


1 2 1 2 @ 
(4 = raB)a + (4 = raBe)y + ne? = 1. (103) 


The principal indices of refraction and their respective crystallographic 
directions are thus 
2 no 


Nn, = 1+ nrul, along [110] 


(104) 


2 
° ee No 7 

0° case a peer ony 5 nari along [110] 

Ne = along [001]. 

In the 90° modulator, the field is applied in the z-direction. For this 

case one principal axis of the index ellipsoid lies along x’, and the re- 

maining two in the y’z’ plane, at 45° to y’ and 2’. Again, taking these 

principal axes as coordinate axes, we find for the principal indices of 
refraction 


Ns = No along [100] 
2 
° 2 No 
90° case my = [eae along [011] (105) 
2 
fie along [011]. 


1+ noerali, 


A.2 The Quadratic Electro-Optic Effect in Cubic and/or Amorphous Mate- 
rials 


All substances (solids, liquids, and gases) which have a center of 
symmetry exhibit the quadratic Kerr effect. For most amorphous sub- 
stances the electro-optic Kerr effect may be described by the empirical 
relation” 


An = K\oE”. (106) 
In this relation An is the induced birefringence, K is the Kerr constant, 


* It is not essential that the electric field be applied in a [001] direction. In the 
general case, however, a principal axis of the index ellipsoid will not be in the di- 
rection of the applied field, as we assumed in our analysis in Section II. This com- 
plicates both the analysis and operation of the modulator, and results in a 
decreased electro-optic effect. The basic approach used here would nonetheless re- 
main valid, since for cubic materials the orientation of the index ellipsoid depends 
only on the direction of the applied field, and not on its magnitude. Thus, there is 
no ‘‘wobbling”’ of the index ellipsoid, and the two normal modes are not mixed 
in the modulation process. 
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No is the free-space optical wavelength, and EF is the applied field. Thus 
in the presence of an electric field isotropic materials effectively become 
uniaxial and will therefore display the phenomenon of double refraction. 

For crystalline solids the effect is more complicated, and must in 
general be described in terms of a matrix of coefficients. Thus in the 
general equation for the index ellipsoid 


Aye” fs Aooy"” sig A332” = 2ayor’y’ = 2ao3y’2’ + 2an2'x’ = 1 
the coefficients have the field dependence 


1 2 2 : 
ay — a puller + prolly” + pili” + pulley + ply Le (107) 


+ pigliz Lg etc. 


leading to a 6 X 6 matrix of 36 independent electro-optic coefficients 
pi; . As in the linear case, however, crystal symmetry limits the number 
of nonzero coefficients, and in fact for cubic materials having the highest 
symmetry (O,) the quadratic electro-optic effect matrix contains only 
three independent coefficients and is of the form 


p pp pp O O O 
Po Pi p2 0 0 0 
 _ |p om om 9 0 O 
-7 10 0 0 p 0 O 
0 0 0 O pp; O 
Ri 00 0 0 6 


Note that for amorphous materials pp = 0, and p; = p3. 
If the field is now applied along a cube edge, e.g., the 2’ axis of a crys- 
tal of symmetry O, , the equation for the index ellipsoid reduces to 


(108) 


1 

= (2? + y” 4 2) + Ee[oo(a® + y”) + pz] = 1. (109) 
This equation is in normal form as it stands and shows that the field 
induces a change in the refractive index isotropically in the z’y’ plane 
as well as along the field direction or 2’ axis. The principal indices of re- 
fraction determined from (109) are 

2 


re ee No 
Re = ly = Ta: Tne ple along [100], [010] 


0° case (110) 


2 

2 No 
fee, ete i). 
n 1 nde along [001] 


2670 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


For a field applied in the x direction (90° modulator) the correspond- 
ing results are 


2 
2 No 
1 + nepili,’ 


90° case (111) 


Nz along [100] 


2 
ier, eee No 
ty = Me = Ta nol along [010], [001]. 


A.3 Linear Electro-Optic Effect in Uniaxial Materials of Symmetry Dea 


For an electric field applied along the c axis of a uniaxial crystal (taken 
to be the 2’ axis, so that we have the 0° case), the equation for the index 
ellipsoid is® 

1 12 12 i} 12 7 nA 

Sa ty) +52 +%rekeacy = 1 (112) 

No Ne 
where no and n, are respectively the ordinary and extraordinary re- 
fractive indices, and 73 is the electro-optic coefficient of interest. As was 
the case for cubic materials, one of the principal axes of this ellipsoid 
lies in the crystallographic [001] direction, the other two in the [110] 
and [110], so that in a coordinate system 2,y,z coincident with these 
axes the principal indices of refraction are 


2 
2 No 


SS oy 11 
Ne RoR along [110] 
2 
° ae No = (113) 
0” case Ny = 1 — nerall. along [110] 
Ne = Ne along [001]. 


For the 90° case the modulating field is still applied along the c axis, 
but this is taken in the x direction so that the principal indices of re- 
fraction are 


Ne = Ne along [100] 
2 no 
90° case a eee along [011] (114) 
2 
Ne ck along [010]. 


~ ie Ne Tesll'z 
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APPENDIX B 


In this appendix we develop some expressions for the output of the 
three modulator-detector systems enumerated in Section 5.6, using (83) 
and (84) to account for the effects of random strain. These equations for 
the wave amplitudes at the output of the modulator were 


e,(r,t) = EF, exp {j[wt + 9, (t)]} exp [76,(r)] (115) 
and 


ey(r,t) = Ey exp {jlwt + ¢)(t)]} exp [76)(1)]. (116) 


B.1 Amplitude Modulators 


When the waves given by (115) and (116) are incident on an analyzer 
whose axes are inclined at an angle 6 to e, , the transmitted wave is 


e(r,t) = e,(r,t) cos 8 + e)(1,t) sin 4. (117) 


The total transmitted power is proportional to 
/ | e:(r,t) |'da, 


the integration extending over the beam cross section, and contains the 
ac terms or information carrying output terms 


P(t) |ac « HEL, cos @ sin 8 cos g(t) | cos d(r) da 
8 
(118) 
— sin ¢(t) i sin 6(r) aa | 
Ss 


where g(t) = ¢,(t) — g(t) and 6(r) = 6,(r) — 6,(r). 


B.2 Heterodyne Detection 


In this system we beat in a square-law mixer a phase-modulated sig- 
nal with a temporally and spatially coherent local oscillator, obtaining 
an output proportional to the product of the local oscillator and signal 
amplitudes. For a local oscillator wave of the form 


éxo(r,t) = Exo exp [j(wxot =F v)] (119) 
the wave amplitude of the output difference frequency is given by 


ea(r,t) « By, |Bxo cos[(w — wro)t —P+o., 1(t) +44 ,4(2)]. (120) 
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This represents the wave amplitude produced at each point on the sur- 
face of the mixer. What is of significance beyond this point is the aver- 
age amplitude (ea(t)), since this determines the response of the IF sys- 
tem.* Averaging over the mixer cross section A» gives 


(ed(t)) = By, \Exo {eos Ce eee ee) 


L 


x 4- 


[cos 6,,\(r) da — sin [(w — wot — p (121) 
+e.Ol ge | sine...) dah 


B.3 Phase Modulator and Optical Discriminator 


To be specific, we consider an elementary discriminator formed by 
operating on the side of the resonance of a Fabry-Perot etalon. If we 
operate at the half-transmission point, the output wave is to first order 
given by 


€o(1,t) T= all = M (inst = w)]e, : (1,0) (122) 


where M is the slope of the transmission curve at w, and inst is the 
instantaneous frequency 


0 
Winst = Ey, [wt + Yu, y(t) + by, )(r)] 
‘ «) (123) 
= OP as WO) 
w + Al ; 
The total transmitted power is proportional to 


Jl eotrs) faa, 


and for small frequency deviations the output power is therefore given 
by 


P(t) o E + M fea | E,, if | exp [75,, i(r)] ? Ae. (124) 


* Regardless of the form of detector (photodiode,®*“° photoconductor* or 
photoemitter*?3) the optical mixing occurs over a finite area large compared with 
an optical wavelength, but small compared to the different frequency wave- 
length. Thus we consider that the total difference frequency output of the mixer 
is a sum of the independent contributions from each elemental portion of the ac- 
tive area, i.e., it is proportional to the value of eg averaged over the photo surface. 


LIGHT MODULATORS 2673 


It should be borne in mind that (124) is strictly valid only when the 
discriminator is illuminated with perfectly collimated light. Strain has 
the additional complicating effect of producing a spread in the beam in- 
cident on the discriminator. Such a distribution of incident angles is 
equivalent to a broadening of the discriminator frequency response, and 
hence reduces the FM-to-AM conversion efficiency. 


APPENDIX C 


The coupling structure analyzed in this appendix is the Fabry-Perot 
resonator shown schematically in Fig. 10(a). The dimensions of the 
resonator L, (length) and 7, (spacing) are both assumed to be much 
larger than the optical wavelength. The structure can therefore be 
analyzed in terms of plane waves bouncing back and forth between 
plates. Since the length of the coupler is finite, we assume that energy 
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ul be MODULATING MEDIUM, 
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Fig. 10 — Fabry-Perot coupler for optically matching to the modulator when 
the aperture exceeds the reflector spacing: (a) geometrical arrangement; (b) equiv- 
alent circuit. 
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leaks off to the right in the y direction with group velocity v, = (c/n) 
sin 6;. Neither diffraction losses nor losses within the electro-optic me- 
dium are included in this treatment. 

We take as a model for the Fabry-Perot resonator a resonant trans- 
mission line system with equivalent circuit shown in Fig. 10(b). The 
resonator is represented by a transmission line which is an integral num- 
ber of half wavelengths long in the transverse or x direction. The “walk- 
off” losses are represented by assigning a small resistance, Z, = rZz , to 
the terminating short circuit. Losses can be introduced in a variety of 
ways, but for high-Q resonators the results are independent of the way 
the losses are introduced; for the present treatment the representation 
chosen is particularly convenient. The finite reflectivity dielectric multi- 
layers are represented by an ideal transformer. Strictly speaking, the 
transformer turns ratio n will depend upon the angle of incidence and 
details of the multilayer construction. Experimentally, n is deduced by 
measuring the reflectivity of the multilayers when the resonator is 
terminated so as to eliminate reflections from the back surface (1.e., 
the perfect reflector). Finally, the characteristic impedances Z, and Ze» 
refer, respectively, to the wave impedances of free space and the electro- 
optic material in the x direction. They depend upon the index of re- 
fraction, angle of incidence, and direction of polarization of the optical 
signal. In order to illustrate the design principles for such a coupling 
system, we note that for waves polarized perpendicular to the plane of 
incidence in Fig. 10(a), Z; and Z are given by’ 


2 = n Sec 6, (125) 


and 
Zo =” sec 6; (126) 
No 


where 7» = 377 ohms is the intrinsic impedance of free space and 7p is 
the refractive index of the modulator material. 

The objective of the analysis which is to follow is basically to calcu- 
late the dimensions of the Fabry-Perot coupler and the required re- 
flectivity for 100 per cent transmission of light, in terms of known pa- 
rameters. We begin by observing that for a given aperture D, the coupler 
length ZL, is given either by 


L. = Ds(1 — no sin? 6;)? (127) 
where use has been made of Snell’s law in relating 6; to @,’, or by 
L, = D,/cos 6;. (128) 
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Two equations are required to specify Z,, since it is possible for the 
angle of incidence required to achieve velocity synchronism to exceed 
the maximum achievable incident angle 6; determined by the limit 
angle relation 


sin 0; = 1/no e 


It is found in fact that in order to achieve velocity matching the ex- 
ternal angle of incidence must satisfy the relation 


No 


and for the case np > Wx’, (e.g., CuCl) we cannot employ the arrange- 
ment of Fig. 10(a) for which (127) applies. In that case it is necessary 
to use a more complex coupler such as the one given in Fig. 11 and use 
(128) in place of (127) to calculate L, . The spacing /, can next be found 
from the resonance condition 8,1, = mz, which yields 


Mo 


c= Dn Cos 6 ) for m= 0, 1, 2, = iss (129) 
0 a 


The calculation of the requisite reflectivity for 100 per cent light trans- 
mission into the modulator can be found in the following manner. We 
can couple all of the incident energy into the modulator by choosing 
the turns ratio n (i.e., the reflectivity of the multilayers) to obtain 
critical coupling, in which case the loaded Q, Q: , becomes half of the 
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Fig. 11 — Modified coupling arrangement used when no?/1/*’ > 1. 
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unloaded Q, Qo, of the resonator. Since the resonator is an integral 
number of half wavelengths long, the condition for critical coupling is 
equivalent to choosing n to satisfy the impedance matching relation 


2 
n =r Zr. (130) 
Disregarding for the moment the determination of 7, which, as will be 
shown subsequently, can be expressed in terms of the unloaded resona- 
tor Q, we find that the multilayer reflectivity, or square of the input 
reflection coefficient when transmission line Z: is terminated [see Fig. 
10(b)], must be chosen so that 


Re (} = ay (131) 


Equation (130) has been used in obtaining this result. 

There only remains now the problem of determining the dimensionless 
constant r. The unloaded Q of the resonator can be found by observing 
that energy must leak off in the y-direction [Fig. 10(a)] with group 
velocity v, = (c/no) sin 6; . The power flow to the right then is 


Ws 
L, 





p= 





Ug 


where W,/L, is the optical energy stored per unit length in the Fabry- 
Perot, and so the resonator Q(Qo = wW,/P) is simply 


Qo — econ ese 6;. (132) 

Xo 
On the other hand, for the equivalent circuit of Fig. 10(b) the unloaded 
Q is given by 


Qo = ma /2r. (133) 


The constant r can now be found by equating (132) and (133). Then 
by using (127) or (128) and (129) to eliminate Z, and the integer m, 
we finally arrive at the formula 


1 1 V1 — No sin? 6; for No < VK! 
la D, sin 6; cos 6; (134) 
a cos 6; for no > Vx’. 


It should be noted that the final results drawn from this analysis are 
valid only when the light is polarized normal to the plane of incidence. 
The method of analysis outlined applies quite generally, however, and 
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can be used for any plane of polarization provided that the character- 
istic impedances Z, and Z, can be determined. For design purposes the 
expressions of particular significance are (127), (128) and (129), which 
fix the dimensions of the coupler, and (131) and (134), which give the 
necessary reflectivity for transferring 100 per cent of the light into the 
modulator (neglecting, of course, diffraction losses). 

One final comment is in order concerning the output coupler. It is 
evident almost by inspection that the output coupler could be of pre- 
cisely the same form as the input coupler but would transmit only half 
of the light. As an alternative, we suggest using a scheme whereby the 
beam is simply allowed to diverge at the output of the modulator. If the 
beam divergence as determined by (49) is not excessive, then in principle 
all of the light could be collected by a lens, thus making the output 
coupler 100 per cent efficient. 
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Systematic Jitter in a Chain of 
Digital Regenerators 


By C. J. BYRNE,* B. J. KARAFIN and D. B. ROBINSON, JR. 
(Manuscript received May 29, 1963) 


Digital pattern variations are a major source of timing jitier in self- 
tamed digital regenerators. In a chain of similar regenerators, the same jitter 
is introduced at each one. Therefore, the jitter accumulates systematically. 

A model is introduced to predict the accumulation of jitter in a chain of 
identical regenerators with single tuned timing filters. Simple expressions 
have been derived from the model for the response of the timing phase to sud- 
den pattern transitions and to random patterns. These results show that the 
guter grows without bound in a long chain, but its form is such that the tend- 
ency to cause errors will not increase as the chain becomes longer. 

Measurements on a chain of 84 regenerators in a field installation are in 
excellent agreement with the theory. 


I. INTRODUCTION 


An ideal digital regenerator must restore the shape and timing of a re- 
ceived signal. A self-timed regenerator extracts the timing information 
from the received pulse train by means of a narrow bandpass filter 
tuned to the repetition rate. Imperfections in the regenerator and noise 
in the channel disturb the timing information and cause the phase of 
the pulse train to vary. This variation is called jitter. 

Jitter in self-timed repeaters has been studied for some time. Sunde! 
and DeLange? concluded that jitter due to random noise would not be 
serious. Attention then turned to jitter introduced in the timing tanks by 
pattern variations. Bennett? studied a single mistuned regenerator. Rowe? 
analyzed a long chain of tuned regenerators. 8. O. Rice studied jitter due 
to a random pattern in a long chain of systematically detuned regenera- 
tors. 

The first fully developed digital regenerator intended for use in long 
chains became available in 1961. This was the repeater for the Tl PCM 
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Carrier System. Investigations of jitter in this repeater by R. C. Chap- 
man, Jr. and later by the present authors clearly indicated that the pre- 
dominant jitter in chains was caused by systematic sources related to the 
pattern. No one cause of jitter was predominant, but rather several con- 
tributed significantly. Mistuning did not appear to be a major contribu- 
tor. 

As a result of these observations, it was felt that a simple, approxi- 
mate theory was needed instead of the more complex, precise attack 
which had been used with mistuning. Should each of the several sources 
of jitter require its own precise analysis, the complete analysis, with all 
sources, might be extremely cumbersome. Therefore, Chapman proposed 
a simple model which depends on measurement to give the phase shift 
generated in each repeater. He used the model to predict how jitter would 
build up in a long chain of repeaters for a transition from one repetitive 
pattern to another. In this paper we will adopt Chapman’s model and 
derive from it the behavior of a long chain of repeaters for arbitrary in- 
put patterns. The theory derives the dynamic behavior of a long chain 
of repeaters from static measurements of a short chain of repeaters using 
simple repetitive patterns. 

The theory has been tested by measurements on a chain of 84 repeaters, 
installed in the field. Agreement with the theory is very good, and is 
best for the repeaters near the end of the chain. There is no theoretical 
reason or experimental evidence to suspect that the theory is not valid 
for much longer chains. 


Tl. A MODEL OF JITTER ACCUMULATION 


In a self-timed forward acting regenerator® (see Fig. 1) a timing wave 
is extracted from the received digital signal by a narrow bandpass filter. 
This timing wave will be phase modulated by whatever jitter has ac- 
cumulated in the signal. The filter will smooth or reduce the amount of 
the phase modulation. 
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TIMING 
FILTER 






SHAPER 


Fig. 1— Block diagram of a self-timed forward acting regenerator. 
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The filtered timing wave is then used to retime the digital pattern. 
With ideal retiming a sharp spike is formed. This spike determines the 
time at which a decision is made by the amplitude regenerator and thus 
determines the leading edge of each pulse. Therefore, the digital pattern 
carries the phase of the timing wave as pulse position information. Some 
high-frequency, fine structure is lost, since only digital marks carry tim- 
ing information, but the high-frequency jitter is not significant in long 
chains. 

In the process of extracting, reshaping, and using the timing wave, the 
regenerators distort the timing and inject jitter. This jitter adds to that in 
the incoming signal and is processed with it in all regenerators further 
down the chain. The total jitter tends to increase as the chain grows 
longer. Our purpose is to describe the manner in which this jitter ac- 
cumulates. 

The model originally proposed by Chapman is shown in [ig. 2. The 
digital signal is not represented, since it acts only as a carrier for the 
timing wave for present purposes. The delays between regenerators are 
also eliminated, since they do not affect the manner of jitter accumula- 
tion. 

A number of assumptions have been made to make the model tracta- 
ble. These are: 

1. The same jitter is injected at each repeater. This assumption pre- 
cludes causes such as random noise and neglects variations in regenerator 
parameters. This assumption seems reasonable, since systematic jitter 
accumulates more rapidly than the nonsystematic type, and therefore 
the jitter at the end of a sufficiently long chain will be predominantly due 
to systematic causes. The most likely sources of systematic jitter are 
those which are related to the digital pattern itself, which is the same 
at each regenerator. 

2. All significant jitter sources can be represented by an equivalent 
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Fig. 2— Block diagram of Chapman’s model for jitter accumulation. 
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jitter at the input of each timing tank. This apparently arbitrary assump- 
tion will be discussed in the section on jitter sources. 

3. Jitter adds linearly from repeater to repeater. 

4. The timing tank is a simple single-tuned circuit, tuned to the pulse 
repetition frequency. 

5. If the rate of change of phase jitter is smal], then, as shown in Ap- 
pendix A, as far as phase modulation is concerned, the timing tank is 
equivalent to a single-pole low-pass filter. The pole corresponds to the 
half bandwidth of the tuned circuit. 

Therefore, 


l 


ies 7/2) 


(1) 


where 


6; is the input jitter 
6, is the output jitter 
B = w,/2Q is the half bandwidth of the tank. 


Inspection of (1) shows that very low-frequency jitter is unchanged 
by the timing filter, while higher-frequency jitter is attenuated and 
phase shifted. 

The jitter introduced in the first repeater will be modified by the 
jitter transfer function of the timing filter and passed on to the second 
repeater. Here the same jitter will be introduced again and added to the 
jitter received from the first. The sum will be operated on by the timing 
tank of the second repeater and the result passed on to the third. This 
sequence will be repeated to the end of the chain, accumulating jitter 
all the way. 

The jitter at the end of a chain of N repeaters will be equal to the sum 
of the jitter introduced in the last repeater operated on by one tank, the 
jitter introduced in the next-to-last repeater operated on by two tanks 
in cascade, and so on back to the first repeater. Since the jitter wave- 
forms introduced in each repeater are assumed to be identical, we can 
express the sum as 


N 


nts) = & 060) (>> A) (2) 


where O(s) is the transform of the pattern jitter introduced at each 
repeater. 
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The right-hand side of (2) is the sum of a geometric series and can 
easily be shown to be 


owe) = of) 2{1- (p+) |. (3) 


Tquation (3) has an interesting approximation. If we neglect short-time 
(high-frequency ) phenomena, we can restrict our attention to the region 
where the magnitude of s is much less than B. Then for large N, 


iL ig Ss 
( x cami) ~exp(—¥ 7): 
Substituting this into (3) we have 
B N 
Ov(s) Y O(s) _ E — exp (-4 °) | 5 (5) 


In the time domain, (5) becomes 


mw) Bl | oa o(e-%) ar] ©) 


dy(t) & B : “gm Met (7) 


Random jitter is more suitably represented by its power density than 
its Laplace transform. The term “power” is misleading here, because the 
square of an angle has no relation to power. However, we can define 
angular power density in direct analogy to a voltage power density: the 
angular power density at a frequency f is the mean squared value of the 
spectral components of the angle in a band one cycle per second wide 
centered at f. 

The power density of jitter at the end of a chain of N repeaters can 
be found by replacing s by jw in (3) and finding the square of the mag- 
nitude of the transfer function; 


oe (2) a (reyes) 


where & is the power density of the jitter injected in each repeater. 
A simpler approximation to (8) can be derived from (5) 


E N fF 
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The exact transfer function given in (8) has been computed and is 
plotted in Fig. 3 for a range of values of N (the number of repeaters in 
the chain). At very low frequencies the transfer function is flat, and the 
power gain is proportional to the square of N. For higher frequencies, 
the power gain falls off as the inverse square of frequency. As a result of 
this spectrum shaping, the bulk of the jitter gain for large N is at low 
frequency. The strong ripples are interesting, but of little practical 
significance. 

We will return to a study of the properties of (8) and (8), but first we 
will study the sources of jitter in each repeater. 


III. SOURCES OF JITTER 


Systematic jitter must be caused by a disturbance which arrives at 
successive regenerators in step with the pattern. The most likely cause 
of such a disturbance is variation in the pattern itself. In an ideal re- 
generator the pattern would have no effect on the phase of the timing 
wave. Unfortunately, in actual repeaters there are many imperfections 
which produce phase variations correlated with the pattern. 

Causes of pattern jitter include intersymbol interference, finite pulse 
width effects, amplitude-to-phase conversion, and mistuning of the tim- 
ing filter. These sources are extremely difficult to analyze in detail; fur- 
thermore, a careful analysis of one regenerator design would have to be 
completely redone for another. 

As we shall show, the complexity of these jitter sources occurs prin- 
cipally in the detailed high-frequency fine structure of the jitter. The 
low-frequency behavior of jitter (averaged over many pulses) is much 
more tractable. In fact, the low-frequency jitter due to an arbitrary pulse 
pattern can be predicted from measurements with a small set of test 
signals. This permits us to completely ignore the detailed analysis of 
the jitter sources, using very simple measurements of a few sample re- 
generators instead. 

The methods that will be used in this section lack rigor and the as- 
sumptions are not fully justified. This situation cannot be remedied 
without doing just what we wish to avoid: performing exhaustive anal- 
ysis of a particular repeater design. Therefore, we proceed without rigor 
and rely on the experimental evidence to validate the results. 

The first step is to show that the major sources of jitter can be repre- 
sented by an equivalent jitter introduced before the timing filter, and 
that this jitter at a particular instant depends only on the last few bits 
of the pattern. In other words, there are no long memory mechanisms 
in the equivalent jitter sources. 
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Fig. 3 — Jitter spectrum due to a random pattern (computed from the model). 
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The first source to be considered is intersymbol interference. Imperfect 
equalization of the transmission transfer function results in smearing 
each pulse so that adjacent pulses overlap to some extent. When clipping 
is used to reduce noise, the effective center of a pulse depends on whether 
or not pulses are adjacent to it. This effect occurs in front of the timing 
filter and the interaction of the pulses is effective over only a few bit 
intervals. Therefore, interpulse interference has the properties we as- 
sumed for jitter sources. 

Finite pulse width effectst and amplitude-to-phase conversion are 
similar in that they both depend upon the amplitude of the timing wave 
in the tank circuit. This amplitude varies with the density of the pat- 
tern. Finite pulse width effects are due to the disturbance of the timing 
wave when a pulse is actually present. The amount of the phase dis- 
turbance depends upon the amplitude of the timing wave. Amplitude- 
to-phase conversion occurs in detecting zero crossings of the timing wave 
in order to form spikes for ideal retiming. If the threshold is not exactly 
at the zero level of the timing wave, a phase shift will be introduced 
which depends upon the timing wave amplitude. 

At first sight, the jitter due to the timing wave amplitude variations 
appears to contradict the assumptions about jitter sources, since the 
variations are in a signal which has been operated on by the timing 
filter. However, we can show that these effects are equivalent to phase 
shifts before the filter. 

The digital bit pattern can be regarded as a high-frequency amplitude 
modulation of a carrier at the bit repetition frequency. This signal, which 
is also phase modulated by jitter, is passed through the timing filter. 
It is shown in Appendix A that the output of the timing filter (the timing 
wave) will be amplitude and phase modulated, and if A;(s) is the trans- 
form of the input amplitude modulating signal, and if this signal is 
small, then 


A,(s) = A;(s) (10) 


1 
1 + (s/B) 


where A,(s) is the transform of the timing wave amplitude modulating 
signal and B is the half bandwidth of the timing filter. 

If there is an amplitude-to-phase conversion mechanism with a 
coefficient C, the timing wave jitter will be 


oe eee 
Vb (s/B)" 


But it is also shown in Appendix A that a phase modulation 0(s) be- 


0,(s) = CA,(s) a CA,(s) (11) 
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fore the filter will cause a timing wave modulation given by 


8.(s) = O:(s) (12) 


1 
1+ (s/B) ° 
By comparing (11) with (12) it can be seen that a jitter source operat- 
ing on the amplitude modulation of the timing wave after the filter can 
be replaced by an equivalent jitter source operating on the amplitude 
modulation of the bit pattern before the filter. The equivalent jitter 
source is given by 


0,(s) = CA,(s). (13) 


Note that there are no long-term repeater memory effects in this jitter. 
Thus the timing wave amplitude sources have the properties we assumed. 

The remaining source of jitter is mistuning of the timing filter. No 
simple argument can be presented to show that a proper equivalent 
exists for this. However, 8. O. Rice has made a careful study of the 
systematic mistuning jitter. He has found that such jitter is equal to the 
sum of two terms: a high-frequency term which is predominant for short 
chains, and a low-frequency term which is predominant for long chains. 
The latter term is identical to the result we obtain using our model. 
Therefore, although mistuning does not directly fit our model, the re- 
sults for this special source agree with those predicted by our model. In 
the case of the particular regenerators used in the experiments reported 
below, mistuning did not appear to be a major source of Jitter. 

As we have said, the above demonstration is not absolutely rigorous, 
and in the end we depend on the experimental evidence to justify our 
assumptions. This demonstration shows, however, that the model is not 
without some theoretical justification. So now, considering that all the 
important jitter sources can be represented by equivalent sources in front 
of the timing filter and that there are no long-term repeater memory 
mechanisms associated with these equivalent sources, we proceed to show 
how the jitter caused by an arbitrary bit pattern can be predicted from 
measurements with a set of test signals. 

A sample from a bit sequence is shown in Fig. 4(a). Consider this as a 
sequence of blocks of 8 bits each. The factors involved in choosing a 
block length of 8 will be discussed later. The jitter introduced during the 
time span of a particular block will be chiefly caused by the bits within 
the block and only slightly influenced by bits outside of the block. This 
is a consequence of the assumption that there are no long memory 
mechanisms involved. 

Therefore, it is reasonable to assign a particular value to the phase shift 
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Fig. 4— (a) Sample from a bit sequence; (b) actual and approximate jitter 
waveforms. 


of each 8-bit block. This value is the average value of the actual phase 
shift over the interval. The resultant approximate jitter waveform is 
shown in Tig. 4(b). 

The average value of phase shift for a particular 8-bit block may be 
determined by repeating the block over and over again with a suitable 
test signal generator and measuring the de phase shift that results. By 
this means, an average phase shift 8; can be measured for each possible 
pattern 7. There are 36 patterns 8 bits long which are distinct when 
repeated. These patterns are shown in Table I. 

The block length used for measurement must be a compromise; it 
should be as long as possible to minimize end effects, but if it is too long 
there will be so many distinct patterns that the measurements would be 
impractical. The length 8 was chosen as a reasonable value. 

We are now able to represent the jitter 6(t) injected due to an arbitrary 
pattern in terms of measurements with a few test signals. The resultant 
representation is not valid for high frequencies because of the artificial 
structure of the 8-bit blocks. However, it should be approximately 
valid for the low-frequency content of the jitter. Since it has been shown 
that low-frequency jitter accumulates much more strongly than high- 
frequency jitter (see Fig. 3), we are mainly interested in the low-fre- 
quency jitter. The agreement of this model with the actual low-frequency 
jitter will be only approximate because overlap and end effects of the 
blocks have been neglected. 

In the next two sections this model will be applied to special types of 
patterns to predict the nature of jitter produced by them. 


IV. REPETITIVE PATTERN TRANSITIONS 


The worst possible jitter occurs when a repetitive pattern which 
causes an extreme phase lag is suddenly changed to a pattern which 
causes an extreme phase lead or vice versa. Because this is the worst 
case of jitter it is worth careful study. 
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TasBLE I — Static PHASE SuHiFrts OF REPETITIVE 8-Bit PATTERNS 


: 9j : : 
1 
Pattern ‘Nuiibér Pattern peo by oe re 
(Degrees) 
I 10000000 —0.7 8 1 
2 11000000 +5.9 8 1 
3 10100000 +1.3 8 2 
4 10010000 —1.8 8 2 
5 10001000 —1.8 4 1 
6 11100000 +2.0 8 1 
7 11010000 +2.2 8 2 
8 11001000 —2.4 8 2 
9 11000100 —3.1 8 2 
10 11000010 —0.2 8 2 
11 10101000 +1.8 8 3 
12 10100100 —1.3 8 3 
13 11110000 +6.8 8 1 
14 11101000 +0.7 8 2 
15 11100100 —1.8 8 2 
16 11100010 +0.9 8 2 
17 11011000 +3.1 8 2 
18 11010100 —0.4 8 3 
19 11010010 —1.5 8 3 
20 11001100 +0.9 4 1 
21 11001010 —0.9 8 3 
22 10101010 0 (refer- 2 1 
ence) 

23 11111000 +2.8 8 1 
24 11110100 +1.5 8 2 
25 11110010 +1.8 8 2 
26 11101100 —1.1 8 2 
27 11101010 +2.4 8 3 
28 11100110 —0.2 8 2 
29 11011010 +0.9 8 3 
30 11111100 +5.9 8 1 
31 11111010 +3.7 8 2 
32 11110110 +3.9 8 2 
33 11101110 +2.4 4 1 
34 11111110 +5.0 8 1 
35 11111111 +11.2 1 0 
1ORRRRRR +0.2 sum 255 sum 64 


(test random) 





Note: p; = weight/sum of the weights. 
An example of a sudden transition from pattern 2 to pattern 7 is shown 
in Fig. 5. The jitter @ which is injected into cach regenerator suddenly 


jumps from 6; to 6;. The jitter at the end of a chain of N repeaters can 
be found by applying (7) 


by(t) & B | “apm Dat (14) 


where 


2690 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


» NII _n_n_n-— 


PATTERN NO. 22 PATTERN NO. 2 
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to 


Fig. 5 — Sudden pattern transition: (a) bit pattern, (b) injected jitter, and 
(c) resultant jitter. 


6(t) = 0;, t > 0. 


Carrying out the integration, we have 


N6:, t<0O 
7 es “i N 
On(t) NO; + BO; — 6i)t, O<t< 7 (15) 
. N 
No; i> B: 


Equation (15) is plotted in Fig. 5(c). 

The approximation used in deriving (15) is valid for low frequencies, 
and therefore (15) will be a good approximation when N is large. 

Note that the total change of phase at the end of N regenerators will 
be N(6; — 6;). This change is unbounded as the length of the chain 
increases. 

The rate of change of the phase during the transition is 


d a: ra 
sont) = BO; — 8). (16) 
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This rate of change of phase, which acts like a frequency change, is 
limited and independent of N. Further, since (6; — 6;) will be a fraction 
of a radian, the frequency shift will be well within the bandwidth of the 
timing filter (B is the half bandwidth). Note that (16) can be derived 
directly from (3) using the initial-value theorem. 

The worst case of jitter, for both phase and frequency deviations, is 
characterizcd by the maximum value of (6; — 9;) for all possible pairs of 
repetitive patterns. This value can therefore be used as a figure of merit 
for the regenerator. The smaller this figure is, the less will be the maxi- 
mum phase excursion for a given length of chain, and the less will be 
the maximum frequency deviation. 

It may seem unlikely that such a transition will take place on a real 
communication channel because simple repetitive patterns are rare. 
However, they can occur in some systems, particularly when a line is 
idle. Another example is a PCM system being used to transmit slowly 
changing analog data. 


V. RANDOM PATTERNS 


In normal operation a digital channel can be expected to have an es- 
sentially random bit pattern. We can find the jitter injected by such a 
signal by breaking it up into successive 8-bit blocks. The patterns in 
the blocks are assumed to be independently selected from the set of 8-bit 
patterns listed in Table I. The patterns do not occur with equal prob- 
ability. For example, the pattern 10000000 can also occur as 01000000, 
00100000, etc., a total of eight distinct forms. However, the pattern 
10101010 can occur in only two forms, the other being 01010101. The 
probability p; of each pattern is shown in Table I as a relative weight. 

From this model of the jitter 6(¢) injected by a random pattern, we 
can derive the jitter power density ®. Since the model is valid only for 
low frequencies, we will ignore the high-frequency structure of ®. 

The spectrum of the model is derived in Appendix B. It is shown that 
the low-frequency power density is flat (independent of frequency) and 
that the value of the density is 


6 = 270° (17) 
where 7’, is the time duration of each block. 


o is the standard deviation of the block phase shifts 


] pi ] aN 
= Wi 2X D; (4 — iF > pi) (18) 
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where JM is the total number of distinct patterns and p; is the proba- 
bility of pattern 7. 

The value of © from (17) can be substituted into (8) to give the jitter 
at the end of a chain of N repeaters 


«= ())'- Gem) 


Because ® is independent of frequency, the plot of jitter in Fig. 3 can 
also be taken as a plot of the jitter spectrum. The 0 db line represents 
®, the jitter injected in each regenerator. 

A number of conclusions can be drawn from the spectra plotted in 
Fig. 3. The jitter at a given frequency is bounded for all N but the 
bound grows without limit as the frequency approaches zero. The low- 
frequency asymptote grows with an amplitude proportional to N. 

For very long chains of regenerators, the jitter spectrum approaches 


2 


OT sa" (19) 








RB 2 
lim @y > (2) ®. (20) 
N70 (2) 

The mean-square jitter can be found by integrating the jitter spectrum 
over all frequencies (see Appendix C). The result is 


Oy? = BP(N) (21) 


where 
P= QT 30° 


B is the half bandwidth of the timing tank, and 
Py is given in Table II for values of N from 1 to 100. 


For chains of more than 100 repeaters, 
Oy? SY YORBN. (22) 

Therefore, we see that the mean square jitter in long chain increases 
as N and the root mean square jitter increases as the square root of N. 
Rowe’ and DeLange” have shown that root mean square jitter due to 
nonsystematic sources increases only as the fourth root of N. Therefore, 
systematic jitter must dominate in sufficiently long chains. 

It is interesting that the total rms jitter is unbounded, even though 
the jitter components near any one frequency are bounded. This be- 
havior has been confirmed by Kinariwala’ for a very general model of a 
regenerator. The explanation of the paradox lies in the fact that the 
bound for the very low-frequency components becomes very large as the 
chain grows longer. 
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TasLe IT — 9,2 = ®BP(N) 


N P(N) 
1 0.250 
2 0.625 
3 1.031 
4 1.453 
5 1.887 
6 2.308 
7 2.753 
8 3.200 
9 3.652 

10 4.105 

20 8.74 

30 13.27 

40 18.22 

50 23.01 

60 27.82 

70 32.64 

80 37.48 

90 42.32 

100 47.18 


For N > 100, P(N) = N/2. 


Another interesting property of (21) is that the jitter power is propor- 
tional to the timing filter bandwidth. This confirms the intuitively ob- 
vious fact that high-Q tank circuits in the regenerators will reduce jitter. 

The amplitude distribution of jitter is also interesting. In principle, it 
is truncated, since the amplitude is limited by the worst case bounds 
discussed in the previous section. However, the general shape of the 
distribution will be shown to be Gaussian, and for long chains of re- 
peaters the truncation occurs far out on the tail of the distribution. 

To study the amplitude distribution, we will use (7) 


by(t) & B I “em MD tt (23) 


The approximation is valid for low frequency and therefore is good for 
long chains, where low frequencies dominate. When we consider @(t) to 
be a sequence of rectangular blocks of duration 7’, we can write 


Pp 
Ov(pT'n) & BT's >, 6. (24) 
p—M 


Where 6y(p7'z) is the jitter at time p7', 


T3B wk 


K is the number of bits in each block of duration 7’; . 


(nearest integer). 
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Equation (24) shows that the jitter can be expressed as the sum of If 
independent samples from the set of block patterns. It is well known 
(central limit theorem) that such a sum has an amplitude distribution 
which approaches a Gaussian form as M becomes large. 

To see how well a distribution of this nature approaches the Gaussian 
shape, a sample problem was computed. The probability distribution of 
6; was assumed to consist of five equally spaced impulses of equal prob- 
ability. Actually, the distribution would probably be better spread than 
this. The value of AZ was taken as 50. This corresponds to a chain of 13 
repeaters with a Q of 100 and the block length taken as 8 bits. 

The cumulative probability deviation for this case is plotted on prob- 
ability paper in Jig. 6. The straight line represents a true Gaussian dis- 
tribution. Since the distribution is symmetrical, only half of it has been 
plotted. A normalized deviation of 1 corresponds to the standard 
deviation. Truncation of the example shown in Tig. 6 occurs at a nor- 
malized deviation of 10. 
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Fig. 6 — Amplitude distribution of jitter, true Gaussian and computed. 
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Fig. 6 shows that the distribution is Gaussian down to a cumulative 
probability of one part in 10!°. This seems to be sufficient justification 
for assuming a Gaussian distribution for long chains of regenerators. 

In summary, a random pattern introduces jitter with a flat spectrum 
into each regenerator. The power density of this jitter can be predicted 
from measurements with a set of repetitive test signals. 

The jitter at the end of a long chain of N repeaters is approximately 
Gaussian distributed, with an rms value which increases as the square 
root of N, without bound. The power density approaches a bound which 
is inversely proportional to the square of frequency. 


VI. EXPERIMENTS 


Measurements were made both in the laboratory and in a field in- 
stallation which are in agreement with the theory mentioned in the 
previous sections. 

In the laboratory, trains of one to ten repeaters were used, while trains 
of 14 to 84 repeaters were available at a field site located between Pas- 
saic and Newark, New Jersey. The laboratory repeaters were bipolar 
regenerators using ideal forward acting self-timing.’ The field repeaters 
were quite similar to these generally, but differed in their physical shape, 
and in a minor respect in their clock circuitry. The interrepeater spacings 
were controllable in the laboratory only. In the field, all equipment was 
installed in one central office, the signals being sent on repeatered lines 
to other offices and then returned on separate lines. The field situation 
was such that crosstalk and other interference was negligible. Since 
lines could not be terminated in the field conveniently at places other 
than central offices, the minimum number of repeaters that could be 
used was 14. All the different line lengths tested in the field were thus 
multiples of 14. 

The repeatered lines were driven by a word generator which could emit 
a train of bipolar pulses, and the necessary clock references for the phase 
measuring equipment. The generator could emit any of the fixed patterns 
possible with 8 digits, or a random pattern, or a combination of fixed and 
random. In addition, it could switch between two different fixed pat- 
terns at rates of several ke or less. A restriction on the patterns in this 
case was that all the pulses in the less dense pattern must also be present 
in the denser one. 

The phase detector used for measuring the jitter operated in the fol- 
lowing manner. The reference clock from the word generator was used 
to set a flip-flop which was reset by a clock derived from the last re- 
peater in the train. Jitter in this clock would cause the flip-flop to gener- 
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ate a duration-modulated pulse. To extend the measuring range, both 
clocks were divided by two before being sent to the flip-flop. This opera- 
tion did not reduce the amount of jitter presented to the instrument, 
however. The jitter information contained in the duration variations of 
the flip-flop output pulses was converted to corresponding amplitude 
variations by passing those pulses through suitable low-pass filters and 
amplifiers. This, of course, isa standard way of demodulating duration- 
modulated pulses. 

The jitter information, now in the form of voltage variations, was 
passed to a detector. For measuring fixed pattern shifts, an oscilloscope 
and camera were used. For random patterns, a true rms voltmeter and 
high-quality tape recorder served as detectors. 

A typical photograph of transitions between two fixed patterns is 
shown in Fig. 7. Note the close resemblance to the waveform derived 
from the model (Fig. 5c). It was shown in Section IV that the total 
change of phase at the end of N regenerators should be N times the 
phase difference between the two fixed patterns in question for one re- 
generator. Fig. 8 shows the total change of phase plotted against N for 
transitions between various fixed patterns. The relations are clearly 
linear. It is also apparent that pattern density is not the only factor in- 
fluencing phase shift, as was mentioned in Section III. Note for example 
the transitions from 10000000 to 11000000 (1/8 to 2/8) and 10000000 
to 10001000 (1/8 to 1/4). Both have equal density change but phase 
changes of different sign and amplitude. Examination of Fig. 8 would 
lead one to believe that the effect of intersymbol interference on phase 


PHASE LEAD ——~> 





Fig. 7 — Photograph of phase shift due to a transition from a 1/8 pattern 
(top) to an 8/8 pattern (bottom), with 84 repeaters; phase scale, 87.5 degrees/cm; 
time scale, 1 msec/cm. 
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Fig. 8 — Steady-state phase shift between fixed patterns. 


shift is comparable to the effect of density. The transitions from 1/8 to 
1/4 and 1/8 to 1/2 (10101010) involve patterns with little or no inter- 
symbol interference. A doubling of density between 1/4 to 1/2 causes 
about a 50 per cent increase in phase shift. Changing the pattern geome- 
try to 2/8, which has intersymbol interference, is sufficient to cause a 
change in the algebraic sign of the phase shift. In the case of the 1/8 to 
8/8 (all pulse) transition, the effects of density and intersymbol inter- 
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ference partially cancel each other, but the resultant shift indicates that 
the intersymbol interference is the larger. 

Equation (16) states that the rate of change of phase during the 
transition between two given fixed patterns is proportional to the half 
bandwidth of the timing filter and to the phase change between the two 
patterns. Restating this equation 


5 ou() = BO; — 8) = Beas (25) 
where A@ is the phase change per repeater between the two patterns, f, is 
the pulse repetition frequency (timing filter center frequency), and Q is 
the quality factor of the tank. Fig. 9 shows the measured rate of change 
of phase in degrees per millisecond plotted against the absolute value of 
the phase change per repeater. The slopes were obtained from photo- 
graphs of the transients during the various transitions. Using the known 
pulse repetition frequency, (25), and the best line fit from Fig. 9, it is 
possible to arrive at a value of Q for the average repeater tank. This is 
found to be 72. This figure is quite compatible with the expected values 
of loaded Q for repeater timing tanks (about 80). 

To check the effect of repeater spacing, the phase shift for one pattern 
transition (1/8 to 8/8) was measured in the laboratory with the re- 
peaters first correctly spaced at 6000-foot intervals, and then spaced at 
5000-foot intervals with no compensation. This is a more severe condi- 
tion than would be encountered in actual repeater practice. The data of 
Fig. 10 show that shortening the span has little effect on phase shift. 
Photographs showed the transient behavior to be unaffected also. 

The properties of jitter caused by a random pattern were investigated 
with a special quasi-random signal. The pattern was constrained to have 
one forced pulse followed by one forced space at the beginning of every 
block of eight slots, the remaining six spaces being filled with random 
pulses. Forcing the pulse was necessary to ensure that the clock of the 
self-timed repeaters did not die out. There is no particular reason for 
forcing the space except that it kept the over-all pulse density at one 
half. 

In testing the field repeatered lines with this quasi-random pattern, 
the output from the phase detector was recorded on tape and studied 
later with amplitude and frequency distribution analyzers and a true 
rms voltmeter. The results of these measurements are plotted in Figs. 
11-15, and will be used to corroborate the theoretical predictions of 
Section V. 

The measured spectral distributions of the output data are shown in 
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Fig. 9 — Measured rate of change of phase versus the absolute value of the 
phase change per repeater. 


Vig. 11. We have shown in Section V that the spectrum of random Jitter 
injected at each repeater is flat; therefore, the spectrum at the end of a 
chain of repeaters should have the shape of the corresponding jitter 
transfer function, plotted in Fig. 3. A comparison of Figs. 3 and 11 shows 
that they are in excellent agreement, even to the details of the nulls. A 
direct comparison of the measured and theoretical spectra for a chain of 
84 repeaters is shown in Fig. 12. The theoretical curve is given by (8). 
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Fig. 10 — Measurement of the effect of repeater spacing. 
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Fig. 11 — Measured spectral distributions for a random pattern (10 - -- - - - ): 
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Fig. 12— Comparison of measured and theoretical spectra for a chain of 84 
repeaters. 


The two parameters ® and B were adjusted to give the best fit. The cor- 
responding jitter injection density was 0.44 X 10~* degrees squared per 
cycle per second and the tank Q was 94 (compare a value of 72 from 
pattern transition slopes). 

Inspection of (8) shows that the jitter amplitude density at very low 
frequencies should be directly proportional to the number of repeaters. 
This is a consequence of assuming systematic jitter sources. The meas- 
ured low-frequency jitter density is plotted against the number of re- 
peaters in Fig. 18. The curve is indeed linear. The slope of this line can 
be used to find the power density of jitter injected at each repeater 
(0.44 X 10~-* degrees squared per cycle per second). The rms value of 
the jitter was measured for each chain of repeaters. The result is shown 
in Fig. 14. The solid curve is plotted from (21), with the product (®B) 
adjusted for a best fit. Using a value of 0.44 & 10~* for ® (from Fig. 13) 
the value of Q is found to be 69, in good agreement with the value 72 
found from slope data. The dotted line in Fig. 14 is the asymptote for 
long chains, given by (22). 

The amplitude distribution of the random jitter was also measured. 
All of the curves were Gaussian in form, with no indication of truncation 
within the limits of the amplitude analyzer. A typical distribution (54 
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Fig. 18 — Measured low-frequency jitter amplitude density versus the number 
of repeaters in the chain. 
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Fig. 14 — Root-mean-square jitter due to a random pattern (10------ ) ver- 
sus the number of repeaters in the chain, measured and calculated. 
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repeaters) is plotted on probability paper in Fig. 15. Note that the curve 
extends to amplitudes which have a probability of only 2 K 10~ of being 
exceeded, with no sign of truncation. 

The data described above establish all the predicted properties of the 
model. However, the important parameter ®, the jitter power density 
injected at each regenerator, has been measured using random patterns 
on long chains of repeaters. Equation (17) relates ® to measurements 
with simple repetitive patterns on a few repeaters. This allows the per- 
formance of long chains of repeaters to be predicted from short chains, 
measured in the lab. To test the method, a chain of 10 repeaters was set 
up in the lab, as described above. The average phase shift of these re- 
peaters was measured for every possible 8-bit repetitive pattern, as shown 
in Table I. Pattern 22 was used as a reference. 

The standard deviation of the phase shifts was found to be 2.225°, 
calculated according to (18). The weights used were those for the test 
random pattern. Using this value, (17) was used to find the power 
density ®, yielding a value of 0.513 & 10-* deg?/eps. Since the measured 
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Fig. 15 — Typical amplitude distribution of jitter due to a random pattern 
(10------ ) (64 repeaters). 
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value on long chains in the field was 0.44 & 10-‘ deg?/eps, this result 
validates the assumptions made in deriving the model and shows that 
simple measurements in the lab can adequately predict the behavior of 
long chains in the field. 


VII. ALIGNMENT JITTER 


We have shown that jitter is unbounded; however, we will now show 
that the nature of jitter is such that it will not contribute strongly to 
digital errors in the regenerator chain. 

Alignment jitter has been defined by Rowe’ as deviations in alignment 
between the input signal pulse and the corresponding timing pulse. In 
terms of our model, the output jitter of one repeater is the input jitter 
for the next, and the timing jitter of a repeater becomes its output jitter. 
Therefore, we will redefine alignment jitter in a particular repeater as 
the difference between its output jitter and the output jitter of the pre- 
ceding regenerator: 


Ban = On — On-1 (26) 
Substituting from (3) 
1 N 
Oaw(s) = O(s) Gsm) (27) 


Since the jitter injected at each repeater is bounded, we can show by 
a straightforward analysis that the alignment jitter is bounded, and 
further, the bound does not increase as the length of the chain of repeaters 
increases. (See Appendix D.) These properties are easily seen if we con- 
sider simple pattern transition. In that case the alignment jitter in- 
creases from zero to a value equal to the phase shift due to the final 
pattern. (See Fig. 16.) An important assumption in our discussion of 
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Fig. 16 — Alignment jitter for a fixed pattern. 
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alignment jitter is that the frequencies and Q’s of the timing filters are 
identical. 

The error margin for random noise interference has been measured at 
the first and last repeater in a long chain of repeaters in the field. No 
significant difference was found. In addition, analog measurements on 
voice channels in T1 carrier terminals showed no noticeable change as 
the length of the repeatered line coupling these terminals was increased 
from about 11 to about 1380 miles (14 repeaters to 154 repeaters). 

It is clear that even though jitter becomes quite large in long chains, 
the error margin of the repeaters does not decrease. 


VIII. SUMMARY 


We have measured the jitter in a field installation of the T1 24-channel 
PCM carrier system. The maximum number of repeaters in tandem was 
84. The jitter was produced by pattern variations in the signal. 

The largest jitter observed is produced by a sudden transition from 
one repetitive pattern to another. This causes a steady-state phase shift 
which increases linearly with the number of repeaters. The amount of 
the shift depends on the patterns used and is of the order of two degrees 
per repeater. The rate of change of the phase shift also depends on the 
patterns used and is of the order of 2000 radians per second. This phase 
shift slope becomes independent of the number of repeaters. These 
measurements are consistent with the work of R. C. Chapman. 

Jitter produced by random signals has the character of noise. The root 
mean square phase deviation after 84 repeaters is about 10°. For long 
chains of repeaters the rms jitter increases in proportion to the square 
root of the number of repeaters. At a given frequency, the jitter power 
density approaches a limit as the number of repeaters grows large. How- 
ever, at low frequencies this limit is very large and is not approached for 
a reasonable number of repeaters. This low-frequency jitter power 
density is directly proportional to the number of repeaters. Therefore, 
for long chains of repeaters, the jitter is concentrated at low frequencies. 
For 84 repeaters the jitter power is mostly below 400 cycles per second. 
It is interesting to note that this is much less than the half bandwidth 
of the timing filters, which is about 10,000 cycles per second. The 
amplitude distribution of jitter is Gaussian. 

We have derived a model which is in agreement with our measure- 
ments. This model assumes that the causes of jitter are systematic: 
that is, the same at each repeater. While nonsystematic effects are un- 
doubtedly present, their jitter does not accumulate as rapidly as that 
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due to systematic sources and is not significant in long chains. We further 
assume that only the low-frequency components of jitter are important, 
since high-frequency components are strongly attenuated by the timing 
filters. The model quantitatively predicts the measurements of random 
jitter from measurements of the steady-state phase shift of 8-bit repeti- 
tive patterns. 

We have shown that the model is valid for many sources of jitter, 
including amplitude-to-phase conversion, intersymbol interference, and 
finite width effects. It is also valid for the effects of mistuning in very 
long chains of repeaters, although mistuning was not an important 
source of jitter in the repeaters we measured. 
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APPENDIX A 
Amplitude and Jitter Transfer Functions of a Single-Tuned Tank Circuit 


The following argument is based on work by S. O. Rice on the analysis 
of FM systems. The timing wave is extracted from the signal by a high-Q 
single-tuned filter. Therefore, for the input to this filter, we need only 
consider that component of the signal which is near the bit rate. This 
component is amplitude modulated by the pulse density and phase 
modulated by the jitter. 

Therefore, let the signal incident on the filter be given by the real part 
of 


v(t) = exp [jot + j0(t) + a(t)] (28) 


where w, is the bit frequency, and @(¢) and a(t) are arbitrary functions 
of time with the restriction that their time derivatives are small. 
We will write the output of the filter as 


Vo(t) = exp [Jct + J80(t) + ao(t)]. (29) 


Further, if we express the impulse response of the filter as 


a(t) = | VCju)e* af (30) 
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we have 
v(t) = i: exp [jwe(t — «) + j0(t — x) + a(t — x)]g(a) dx. (31) 


Equating (29) and (31), we have 


J9.(t) + ao(t) 
e (32) 
= in [ exp [ja(t — x) + a(t — x) — ju,rlg(a) dx. 


We will assume that the transfer function of the filter is zero for fre- 
quencies above (f. + r) and below (f, — 7), for some r. When we impose 
this condition on (80) and make the change of variable 


we have, to a high degree of approximation 
fte(t) + a(t) = Im [exp (ja(t — 2) + alt — aha) de (34) 
where 
nin) = x [YC jie + jo!) "de (35) 


For a tuned tank 


, 1 
Y (jo. + jo) & ———— 
(Jee + jw) Sz + Flo/B) (36) 
where B is the half bandwidth. 
For a high-Q filter we may let 7 in (35) go to infinity, whereupon 
h(z) = JO, 7) 
erg aes Se0, 30) 


Now we return to the evaluation of (34), which is our problem. We 
will approximate 


pitlt—a)ta(l—2) _ pa+50(0) s (—2)" F, (t) (38) 
0 n!} 
where 


F,,(t) = pore a (39) 
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We now define 
oO 
Mn = 7 x"h(x) dx 
— 00 


or 
mM, = niB”. 


Then from (38) and (40) we have 


i exp [70(¢ — x) + a(t — x)]h(x) dx 


= pita) {1 a y = Oye 
n=1 . 


Now in general 


n nt” 
In (: a. > = 


t 4 2 
= 77a ta (oe — ow) 


3 





+ & (as — Bara + 2x) + ++, 


3! 


So if we take 1 = —1, a, = m,F,,(t), we have 


In ie exp [jo(t — x) + a(t — x)]h(x) dx 


ll 


JO.(t) + a(t) 


_ mr; 
1! 


I 





a(t) + jo(t) + | + 3 (mal, — mi Fy) 


1 
ae 31 (msF’s — 38mm FF, + 2mF;’) + —-- | . 


(40) 


(41) 


(42) 


(43) 


(44) 


Now we replace the F,, by their values from (39), and after some manipu- 


lation we have 


au(t) + 06(t) = [ (galt — 2) + alt = 2)Ih(o) dx 


(45) 


+ (terms in powers of the derivatives of 6(¢) and 


a(t) normalized to the half bandwidth) 


+ (cross products of the derivatives of 6(t) and 


a(t) normalized to the half bandwidth) 
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where we have made use of the fact that 


[a(t) + j0(t)] — mila’(t) + jo’(t)] + > [a”(t) + jo"(é)]) + ++: 
: (46) 
= i late ay 90 ae) nde 


Now, if we neglect powers and cross products of all the derivatives of 
6(t) and a(¢), and use the value of h(x) from (37), we have 


a(t) + 76,(t) = : Be"*"a(t — x) + j0(t — x) dx. (47) 


Equating real and imaginary parts and recognizing the convolution 
integral, we have 


1 
0.(s) = TaG/B) (48) 
and 
1 
A,(s) > T+ (By 4): (49) 


The relationship given in (48) is one of the results we are seeking. 
The amplitude modulating signal, however, is e*, not a(t). If now we 
restrict a(t) to be small, then 


ef m1 + a(t). 


From (47) it is clear that if a(t) is small, then a,(t) is small, and then 
the output amplitude modulation 


eo) ow 1 + a,(Z). 


Then we see that (49) gives the relationship in the transform domain 
between the varying term of the input AM and the varying term of the 
output AM. 


APPENDIX B 


Power Density of a Jitter Produced by a Random Pattern 


The jitter 6(t) produced by a series of 8-bit blocks may be represented 
by 


a(t) = S2G,A(t — n8P) (50) 
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where 
T, is the bit interval, 
6, is the phase shift that would be produced by the pattern of block 
n, in a repetitive pattern, and 


A(t)=1, —-47, <t< 47, 
0, lel > 47;. 


The patterns in successive blocks are statistically independent, and 
6, can be taken to have zero mean. When we add an arbitrary starting 
time we make the process whose sample functions are given by (50) 
stationary. Then the autocorrelation function can be shown to be 


V(r) = o A(r) (51) 
where 
ar) = (1435), —8T, <7 <0 
T 
= aoe, < 
(1 mn) , 0x7 < 87, 
= 0, |r| = 87, 


where 7 is the time difference, and o is the standard deviation of 6p. 
The one-sided power density is given by 


B(w) = of W(r)e*" dr. (52) 


Substituting (51) in (52) gives 


4o° sin’ w4T, 
Applying L’Hospital’s rule we obtain 
€(0) = 2(87.)o°. (54) 


Furthermore, it is clear from (53) that at zero frequency the slope of the 
power density is zero, and because 7’, is small for the system under con- 
sideration we may assume to a very high degree of accuracy that over 
the band of frequencies which are of interest, the power density is flat. 

The values of 6, have a discrete probability distribution, indicated in 
Table I. Therefore, 


1 = 1 mb, 
¢ = u x Di (3, — it 2X pi) (55) 
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where p; is found by dividing the weights by the sum of the weights. 
M is the total number of distinct patterns. 


APPENDIX C 


Integration of the Jitter Spectrum 
We are indebted to A. J. Goldstein for the following integration. 
The spectrum of jitter at the end of a chain of N repeaters was given 
in the section on the model in (8). 


The mean-square jitter is given by 
Bie = = / by(w) dw 
2a Jo 
o [°R 


Qe Jo we 


1 N 
7 Ce: iam) 
eB f° 1 


1 N}2 
lar Jo w i=-(45) 


Ov = ®BP(N) (58) 


2 


dw (57) 











2 




















where 
1 {4 | ae a 
pan = xf S|! (Gp) a 
ee haa | 1 - 1 | 
pa) = 3 alt- (es) |L}- Ge) | 





Let us introduce the dummy variable a and define 


S(a) = 3 P(N) a™ 


N=0 


-/ 52 Gs -(r#e) - (re) oe 
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We recognize that each of the terms in the summation of (60) is a 
geometric series whose sum is given by 

















r= : : (61) 
0 l1-—r 
Using this, we can write 
2r4o w| l—a fi cance _ a 
1 + Jw 1 — Jw 
eee ee 
i_—2 
1+ (62) 
daly at aol 2a(1 — a) 
Qr/o Ll —ae® ow? + (1 — a)? 


a 
recente 
All of the terms above can be integrated by using standard tables. 
The result is 
_< v_1 Gd a 
i Ake =a)? Sian: ou 


If we expand the right-hand term of (63) in a power series, the coeffi- 
cients of the a” will be the P(N) 


al 1 (2N —1)! 
POO = 31% ~ 3 acme = ip Ss 





For values of N greater than 5 we can approximate the factorials by 
Stirling’s formula 


nl VV 2nx (n/e)”. (65) 


The result is 


i 1 NS 3 ) 
Pw) 5 (N ~ See): Sc 
For N > 10 we can make the further approximation 


P(N) wh (n z Rv¥). (67) 
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And for N > 100 
P(N) & N/2. (68) 
Values of P(N) from 1 to 100 are listed in Table IT. 


The mean-square jitter at the end of a long chain of repeaters is ap- 
proximately [using (58) and (68)] 


— BN 


On? & 5 





(69) 


This last approximation can also be derived from direct integration 
of (9). 


APPENDIX D 


Growth of Alignment Jitter with Number of Repeaters in Chain 


The alignment jitter at the nth repeater is given by (27) 


1 
Ouv(s) = O(s) G+ G/B))*? (70) 
where O(s) is the jitter injected at each repeater. 
Equation (70) can be written in the iterative form 
1 
Oav(s) = Oaw—1)(s) 1 + (s/B) . (71) 
In the time domain equation (71) becomes: 
t 
dax(t) = | Baw (b— 7)Be™ dr. (72) 
0 
Then, 
| Dan (t) | < Myi(1 = a) 
or 
| Qav(t) | S Mya, (73) 


where M,,_1 is the maximum value of | @.;v-1)(¢) | on the interval. 
Carrying the iteration back to the first repeater, we have 


| Gaw(t) | S$ Mo(1 — e**)* 
or 
| Gaw(t) | = M. (74) 
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where 
M, = | 6(t) | 


where @(t) is the jitter injected at each repeater. 

Equations (73) and (74) state that the maximum alignment jitter 
does not increase as the number of repeaters grows and is never greater 
than the maximum jitter injected into each repeater. Since the jitter 
injected at each repeater is limited by the circuitry to a fraction of a 
cycle, alignment jitter can never be greater than a fraction of a cycle. 
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Satellite System Interference Tests 
at Andover, Maine 


By HAROLD E. CURTIS 
(Manuscript received July 3, 1963) 


Because of the shortage of suttable frequencies, 21t appears that it will 
be necessary for satellite communication systems and ground microwave 
systems to operate in the same frequency band. To do this without excessive 
interference between them requires that stringent criteria be met. This paper 
presents the results of radio interference tests from two test transmitter sites 
into the Andover, Mazne, earth station receiver for experimental satellite 
communications. Although these sites were only 23.5 miles and 55 miles 
from Andover, an analysis of the results indicated that they could be used as 
locations for ground microwave systems with few limitations. 


I. INTRODUCTION 


In order that satellite communication systems and terrestrial micro- 
wave systems may share successfully the same frequency band, it is in 
general necessary that the earth station of the satellite system and 
microwave station be separated physically so that line-of-sight trans- 
mission between them is not possible. Under these conditions, either 
the scatter mode or diffraction mode of transmission predominates, 
depending on the path profile. While a considerable amount of data on 
such transmission is available with transmitting and receiving antennas 
pointed at one another and only elevated sufficiently to graze the horizon, 
very little data have been taken with one of the antennas, the earth 
station antenna in this case, randomly oriented in azimuth and elevation. 

Predictions!:? have been made that it should be possible to operate 
point-to-point microwave systems in the same radio frequency band as 
the satellite system at distances of some 45 to 150 miles removed from 
the earth station, or even less, provided that (a) the earth station antenna 
is not operated at an elevation angle less than some minimum value, for 
which 7.5 degrees has been suggested, (b) suitable terrain blocking 
exists, and/or (c) a certain amount of angular discrimination is pro- 
vided by the antenna of the radio relay terrestrial station. 
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Thus, one purpose of the tests to be described and evaluated was to 
examine critically the received interfering signal strength as a function 
of elevation and azimuth angle of the earth station antenna beam. A 
second purpose was to record for an extended length of time the received 
signal with the earth station antenna elevated in angle somewhat above 
the elevation value for maximum received interference to see if reflec- 
tions from cloud masses, airplanes, birds, etc., could produce signal 
enhancements of sufficient magnitude and frequency of occurrence to 
degrade significantly the performance of the satellite communication 
system. 

The American Telephone and Telegraph Company already had com- 
mercial operating sites at Cornish, Maine, and at West Paris, Maine — 
55 miles and 23.5 miles, respectively, from the earth station at Andover, 
Maine — and these appeared to be suitable locations at which to place 
test transmitters for the interference tests. A contour map of the area 
is shown on Fig. 1. The interference path profiles (Figs. 2 and 3) indi- 
cated that these sites would be appropriate for evaluating the required 
miminum antenna discrimination toward Andover of an operating 
TD-2 station’ at these sites and/or the required limitation on minimum 
antenna elevation at Andover for compatible operation on a frequency 
sharing basis. 

Although the results of these tests and the conclusions based thereon 
may not be valid for other sites and equipments, they do show that it is 
possible by careful choice of the earth station site to operate microwave 
radio stations in the same frequency band without undue physical 
separation between them. 

Because of the considerable international interest in the question of 
frequency sharing, the criteria of permissible interference are those 
recommended by the Xth Plenary Assembly of the International Radio 
Consultative Committee (C.C.I.R.) held in 1963. 


II. RECORDING SETUP 


A sketch of the experimental setup for recording the received signal 
at Andover for the various test conditions is shown on Fig. 4. It con- 
sisted of the earth station antenna whose output was fed to a maser 
followed by a receiver-converter, pre-amplifier and main IF amplifier 
normally associated with the earth station receiver. At this point a 
4170-mc RF input signal appeared as a signal whose nominal frequency 
was approximately 74.1 mc. The dynamic range of the entire receiver 
from antenna to strip chart was approximately 40 db when the gains of 
the various amplifiers were properly adjusted. This IF signal was then 
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Fig. 2 — Profile of West Paris-Andover path. 


detected by a variable frequency analyzer with a 10-ke bandwidth and 
the rectified output recorded on a strip chart after passing through a 
logarithmic amplifier. No automatic gain control was used, so that the 
amplitude of the 74.1-me signal was directly proportional to the ampli- 
tude of the 4170-me input signal, within the boundaries of noise and 
overload. 

Provision was made, as shown on I'ig. 4, for calibrating the recorder 
in terms of received RIF power. The chart range was approximately 40 
db, but the over-all sensitivity of the setup could be changed, if required, 
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Fig. 4 — Test setup. 


by the insertion of an IF pad of suitable value. The total range covered 
was from about —142 dbm (noise only) to about —70 dbm. The fre- 
quency response of the receiving system was limited by the strip chart 
recorder and was in the order of 60 cycles per second. 

It will be noted that the gain of the RF-IF portion and the sensitivity 
of the analyzer-recorder portion of the system were calibrated separately. 
This was necessary because the RF calibrating oscillator was not stable 
enough in frequency to stay within the 10-ke band of the analyzer. How- 
ever, the frequencies of the test transmitters at Cornish and West Paris 
were adequately stable, and only occasional checks of the tuning of the 
analyzer were required during an extended run. 

All of the values of received signal power given herein are, strictly 
speaking, referred to the input point of the maser amplifier. In view of 
the fact that the loss of the waveguide and associated circuitry con- 
necting the maser to the receiving antenna is only about 0.1 db, they 
are also essentially the same as would have been observed at the antenna 
output. Table I presents information pertinent to these tests. 


III. PATH PROFILES 


The profiles of the West Paris-Andover and Cornish—Andover paths 
are shown on Figs. 2 and 3, respectively. An analysis of the first path 
indicates that, because of the mountain 4.4 miles from Andover, the 
diffraction mode would be expected to predominate over the scatter 
mode. The opposite is true for the Cornish—Andover path. 

The Andover antenna, as will be noted on Fig. 1, is located in a rela- 
tively flat area surrounded by low mountains. Elevation scans of noise 
only were available at 5-degree intervals in azimuth using the Andover 
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TABLE [— PERTINENT INFORMATION 





Station Andover, Maine West Paris, Maine Cornish, Maine 

Status Earth Station Test transmitter | Test transmitter 
Latitude 44° 37’ 59” 44° 17’ 48” 43° 50’ 16” 
Longitude 70° 41’ 52” 70° 87’ 13” 70° 49’ 48” 
Path length — 23.5 miles 55.0 miles 
Frequency source — Crystal oscillator | Crystal oscillator 
Frequency, RF — 4170 me 4170 mc 
Antenna diameter} 68 feet 10 feet 16 feet 

(actual) 
Effective radiated — +64 dbm +78 dbm 

power 
Antenna gain (free-| 54 db* 40 db 43 db 

space) 
Orientation 

Azimuth Variable Toward Andover | Toward Andover 
Elevation Variable Toward horizon Toward horizon 

Bandwidth of test 10 ke — — 

setup 
Noise (at zenith) —142 dbm — — 
Elevationt — 1.57° 2.43° 
Azimutht — 170.69° 184.92° 


* The Andover antenna is arranged normally to receive a circularly polarized 
wave, and for this condition the maximum response is 57 db referred to an iso- 
tropic antenna. The response of the antenna to a linearly polarized incident wave 
under the same condition is 3 db less. 

} These are the elevation and azimuth angles of the Andover, Maine, antenna 
for maximum received signal as determined experimentally. The elevation angle 
is referred to the horizontal in all cases unless specifically stated otherwise. 


antenna, and the elevation angle of the antenna at which the noise 
abruptly changed was noted as the “electrical”? horizon. A 360-degree 
azimuth profile thus obtained is shown on Fig. 5. 


IV. ANTENNA DIRECTIVITY PATTERNS 


Figs. 6 and 7 show vertical and horizontal directivity patterns about 
the main beam of the 68-foot horn-reflector antenna at Andover, as 
measured by J. N. Hines of Bell Telephone Laboratories. A circularly 
polarized signal was transmitted from a reference point at Black Moun- 
tain about 4.9 miles away, and the vertical component of the electrical 
field out of the antenna was recorded as the beam of the antenna at 
Andover was swung in azimuth or raised in elevation about the bearing 
of the reference point. The azimuth and elevation angles of the reference 
point relative to the Andover antenna were 142.097 degrees and 3.977 
degrees, respectively, and the path was line-of-sight. 

The maximum response of the antenna arranged to receive in this 
fashion is about 54 db referred to an isotropic antenna. It will be noted 
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Fig. 5 — Azimuth profile based on noise measurements. 


that the antenna beam had to be elevated about 10 degrees relative to 
the reference direction, in order to reduce the gain to that of an isotropic 
antenna. 

Since the noise power in the measuring equipment was also about 54 
db below the maximum received signal, the maximum value of antenna 
directivity discrimination that could be measured was also limited to 
this same value. 


V. EXPECTED RECEIVED SIGNAL 


5.1 West Parts-Andover Path 


1. Transmitter power +27 dbm 
2. Waveguide and filter loss 3 db 
3. Antenna gain — vertical polarization = 40 db 
4. Effective radiated power = +64 dbm 


| 
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Fig. 6 — Antenna directivity in vertical plane. 
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Fig. 7 — Antenna directivity in horizontal plane. 


5. Estimated knife-edge diffraction basic transmis- 

sion loss* (Median scatter loss estimated as 198 db 

and hence not controlling.) = 168 db 
6. Receiving antenna gain (see note on Table I) = +54 db 
7. Estimated signal power at antenna output on basis 


of knife-edge diffraction = —50dbm 
5.2 Cornish-Andover Path 
1. Transmitter power = +37 dbm 
2. Waveguide and filter loss = 2 db 


* Defined in Ref. 4, ‘Basic Transmission Loss (Ly). The basic transmission loss 
(sometimes called path loss) of a radio circuit is the transmission loss expected 
between ideal, loss-free, isotropic, transmitting and receiving antennae at the 
same locations as the actual transmitting and receiving antennae.”’ 
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3. Path antenna directivity gain.* Experience on 
similar scatter paths indicates that the effective 
antenna gains on such paths are substantially less 
than the free-space gains of 54 db and 48 db for 
the two antennas. This reduction, which is usually 
referred to as the antenna-to-medium coupling 
loss, is estimated to be 11 db for the larger antenna 
and 3 db for the smaller antenna. The path an- 


tenna directivity gain is therefore = 83 db 
4, Estimated median basic transmission loss (scatter 

propagation) during November = 220 db 
5. Estimated received signal power at antenna out- 

put on basis of scatter propagation = —102 dbm 


The transmitter power at a standard TD-2 station’ is +27 dbm, and 
the waveguide and filter losses total about 3.5 db. A horn-reflector an- 
tenna of 10-foot aperture and gain of approximately 40 db at 4200 mc 
is also standard equipment. The transmitted wave is linearly polarized. 
Thus the effective radiated power at Cornish was about 15 db greater 
than that of a normal TD-2 transmitter. 


VI. EXPERIMENTAL RESULTS 


6.1 West Paris-Andover 


As noted above, the predominant mode of transmission over this path 
was expected to be by diffraction rather than by scatter, and hence a 
comparatively steady signal was expected. 

The principal tests over this path may be divided into two parts, (2) 
“fine grain” elevation runs at the azimuth angle corresponding to maxi- 
mum signal when “on-beam,”’ and (iz) “fine grain” azimuth scans at vari- 
ous fixed elevation angles. Certain runs, each of about one hour in dura- 
tion, were made at various fixed elevations, all on the azimuth bearing 
of maximum received signal, to establish the fact that the signal was 
steady in each case. 

The maximum received signal on this path ranged from about —75 
dbm to —80 dbm during the various days of the tests. The expected 
signal as computed above, based on knife-edge diffraction, was —50 

* “Path Antenna Directivity Gain (G,). The path antennae directivity gain is 
equal to the increase in the transmission loss when lossless, isotropic antennae 


are used at the same locations as the actual antennae.’’ Transmission loss L = 
Lyn — G, . See Ref. 4. 
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dbm, indicating that the intervening hill was considerably more effec- 
tive than an ideal knife edge in attenuating the signal. On the basis of 
the received signal levels the basic transmission loss, as defined above, 
was about 193 db for the West Paris—Andover path. Since this was fairly 
constant during the period of any single test, the variations in received 
signal power as the direction of the beam of the Andover antenna was 
changed reflected changes in the path antenna, directivity gain. 

In view of the fact that the basic purpose of this study is the evalua- 
tion of three specific sites from the standpoint of frequency sharing 
between satellite communication systems and ground microwave sys- 
tems, the results are plotted in terms of actual received interfering power 
at the site in question. 

Figs. 8 and 9 show fine structure plots of vertical scans during clear 
weather and during a heavy rain, respectively. Fig. 10 presents a smooth 
curve of the envelope of the peaks of the interference up to an elevation 
of 50 degrees. 

Figs. 11 and 12 are fine grain azimuth scans at fixed elevations of zero 
and +2 degrees, relative to that of maximum signal, i.e., 1.573 degrees 
above the horizontal. Fig. 13 is a similar scan with the antenna elevated 
8.573 degrees above the horizontal. 

Finally, Figs. 14 and 15 present 360-degree azimuth scans at fixed 
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Fig. 8 — Elevation scan, West Paris transmitter activated (vertical scan 
showing fine structure). Signal is steady at any elevation of antenna. 
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Fig. 9 — Elevation scan, West Paris transmitter activated (vertical scan show- 


ing fine structure). Signal is steady at any elevation of antenna. 


elevations. In these figures the maximum signal values in sectors of vari- 
ous widths are plotted. It will be noted on Fig. 14 that the signal, when 
the antenna was directed at about 180 degrees from the direction of 
West Paris, was only about 10 db below the signal received when pointed 
directly at West Paris. A similar reflection down 14 db was noted when 
the Cornish transmitter was activated and the Andover antenna rotated 
through 360 degrees of azimuth at an elevation angle of 2.43 degrees. 
The source of the reflections is as yet unidentified. 
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Fig. 10 — Elevation scan, West Paris transmitter activated (envelope of peaks 
maximum signal). 
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Fig. 11 — Azimuth scan, West Paris transmitter activated (receiving antenna 
elevated for maximum signal, i.e., 1.573° above the horizontal). 
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Fig. 12 — Azimuth scan, West Paris transmitter activated (receiving antenna 
elevated 2° above that for maximum signal, i.e., 3.573° above the horizontal). 
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Fig. 138 — Azimuth scan, West Paris transmitter activated (receiving antenna 
elevated 7° above that for maximum signal, i.e., 8.573° above the horizontal). 
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In addition to the elevation and azimuth scans such as were also made 
on the West Paris-Andover path, runs totaling 107 hours were made 
with the Andover antenna elevated in angle sufficiently to bring the 
median signal down close to the level of the noise in order that periods 
of possible signal enhancement could be accurately observed. The azi- 
muth angle at all times was that for maximum signal when ‘‘on-beam,” 
i.e., 184.92 degrees. 

When on the coordinates of maximum interference, the signal was 
steady within a db, and measured about —108 dbm at the input of the 
maser, aS compared with the expected value of —102 dbm. Thus the 
basic transmission loss of the path at this time was 226 db based on an 
estimated path antenna dircctivity gain of 83 db. Elevating the receiving 
antenna less than a degree altered the character of the signal so that it 
varied continually about +10 db. Typical portions of the strip chart 
traces are shown for several antenna elevations on Fig. 16. 

Fig. 17 shows the results of an elevation run giving median values of 
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Fig. 15 — Azimuth interference profile, West Paris transmitter activated 
(receiving antenna elevated 7.5° above the horizontal). 
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Fig. 16 — Received signal from Cornish for various antenna elevations at 
Andover (elevation angles are in degrees above that for maximum signal). 


interference at intervals of 0.2 degree in elevation. The variation in sig- 
nal strength was so great that it was impossible to reset the antenna 
elevation to a particular angle and obtain the same interference level to 
within the order of +5 db. A reproduction of a portion of the strip chart 
from which the results on Fig. 17 were obtained is shown on Fig. 18. 
Here the rate of elevation scan is 0.048 degree per second. Also shown on 
Fig. 18 for comparison is an elevation scan made at Andover with the 
West Paris transmitter activated. 

An azimuth scan with the antenna beam elevated 7.5 degrees above 
the horizontal with the Cornish transmitter activated showed that the 
interference power was below the noise, i.e., —141 dbm, for all angles 
except in the direction of Cornish. Here the interference power reached 
a maximum of —132 dbm. 

Fig. 19 shows a running plot of hourly median received signal power 
at Andover extending over a period of nine days, the actual measuring 
time totaling 107 hours. Furthermore, since the main interest was in 
signal enhancement, several different elevation angles were used during 
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the period. Thus the changes in average hourly median from day to day 
reflect to a considerable extent changes in path antenna directivity gain. 

Signal enhancements in which the median power rose 7 db or more 
for brief periods were individually examined, and the median value at 
the time of maximum rise relative to the normal median signal level at 
the time was tabulated. The duration of each enhancement at a level 
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_ Fig. 17 — Elevation scan, Cornish transmitter activated (Andover antenna 
directed toward Cornish; data from Fig. 18). 
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Fig. 18 — Elevation scans (elevation angles are in degrees above that for maximum signal’ in each case; azimuth angles are 
fixed at value giving maximum signal at reference elevation in each case; elevation scan rate is 0.043° per second). 
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Fig. 19 — Hourly median received signal at Andover from Cornish (elevation angles are referred to that for maximum re- 
ceived signal, i.e., 2.483° above the horizontal). 
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halfway (in db) from the normal median to the peak median value was 
also recorded. 

The short-term, e.g., five seconds, median varies normally about +3 
db in any brief period such as an hour, and such normally expected in- 
creases associated with these variations were not classified as enhance- 
ments. Three typical periods that were classed as enhancements are 
shown on the strip chart reproductions on Tig. 20. Here the chart speed 
is one small division per second. In the top example on this figure the 
average median is taken as —121 dbm, and the median at the peak as 
—106 dbm, and the enhancement is thus 15 db. The enhancement in 
the middle example is also 15 db, and for the bottom is 20 db. 

A total of 56 enhancements thus defined occurred in the 107-hour 
period. Fig. 21 shows a plot of the cumulative percentage of the total 
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Fig. 20 — Typical periods of signal enhancements — Cornish-Andover path 
alevation angles are in degrees above that for maximum signal). 
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SIGNAL ENHANCEMENT IN DECIBELS 


Fig. 21— Probability of signal enhancements — Cornish-Andover path 
(cumulative percentage of time the signal strength was enhanced any given 
amount above the median, normal variations in median excluded; duration of 
observations 107 hours). 


period of 107 hours the signal was enhanced, as defined above, any 
given amount or more above the median. Thus 0.01 per cent of the 107 
hours the signal was enhanced 27 db or more above the median signal 
value. 

Fig. 22 shows the distribution of the duration of signal enhancements. 


VIT. INTERPRETATION OF RESULTS AND CONCLUSIONS 


7.1 Cornish-Andover Path 


Recommendation No. 356* of the Xth Plenary Assembly of the 
C.C.1.R. held in 1963 provides that the mean value of interference from 
radio relay systems into an earth station receiver in any hour should 
not exceed 1000 picowatts, psophometrically weighted. On the basis of 
four exposures into the earth station receiver, the message channel inter- 
ference per exposure is 250 picowatts or 18 dba at reference level. 

The signal-to-interference ratio at baseband may be related to the 
ratio of the desired carrier to the interfering carrier by the Receiver 
Transfer Characteristic as defined in Recommendation No. 356. The 


* This will appear in the forthcoming publications of the Xth Plenary As- 
sembly, Geneva, 1963. 
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Fig. 22 — Duration of signal enhancements — Cornish-Andover path (per- 
centage of enhancement periods having durations equal to or greater than values 
shown on abscissa; sample of 56 enhancements in a total of 107 hours). 


magnitude of the Receiver Transfer Characteristic depends upon the 
frequency separation of the two carriers, the characteristics of the modu- 
lation, and the radio receivers themselves. Fig. 3 of Ref. 2 shows com- 
puted values of this relationship for a possible satellite system, a T'D-2 
system and a TH system.® Here the signal is Gaussian noise having the 
same power per channel as an average talker, and the interference is 
expressed in dba at a zero-db transmission level point, or reference level. 
Thus we find that the mean value of the ratio of the desired carrier to 
the interfering carrier should be 25 db or greater in order that the base- 
band noise not exceed 18 dba at reference level. A reasonable value for 
the received carrier power at the earth station receiver from the satellite 
is about —98 dbm. The interfering carrier should then be —123 dbm 
or less. 
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A signal power of about —128 dbm was observed on the average when 
the Andover antenna was directed at the Cornish transmitter and ele- 
vated 7.5 degrees above the horizontal. Since the power into the trans- 
mitting antenna was +35 dbm, the transmission loss is 163 db. Because 
of higher power and antenna gain, the effective radiated power of the 
Cornish test transmitter was about 15 db greater than that of a standard 
TD-2 transmitter. Therefore it may be concluded that a TD-2 trans- 
mitter could be operated at Cornish, Maine, with its antenna directed 
toward the earth station, and the mean interference during the period 
of measurement would have been 20 db below the permissible value 
recommended by the C.C.I.R. 

Recommendation No. 356 also provides that the message channel 
interference into an earth station receiver should not exceed 80,000 
picowatts, psophometrically weighted, or 43 dba at reference level, for 
more than 0.02 per cent of any month. On the basis of four interferences 
per satellite ground station, the probability of occurrence per exposure 
should not exceed 0.005 per cent. 

Reference to Fig. 21 shows that enhancements of 33 db were observed 
during the test period with a probability of 0.005 per cent. This leads 
to an expected interference value of —110 dbm, or a ratio of desired to 
interfering carrier power of 12 db, at the antenna output of the satellite 
ground receiver. This corresponds to 31 dba, or 5000 picowatts, a value 
well within the above objective developed from the C.C.I.R. recom- 
mendation. 

The earth station transmitter at Andover operates in the 6-kmc com- 
mon carrier band, and therefore it could potentially interfere with a 
common carrier receiver such as the TH microwave system operating 
in this band. While no companion interference measurements were made 
in this frequency range, it is quite practical to estimate with adequate 
accuracy the interference from Andover that might fall into a TH re- 
ceiver that might be located at West Paris or at Cornish. 

Recommendation No. 357 of the Xth Plenary Assembly of the 
C.C.I.R. provides that the interference from an earth station transmitter 
into a radio relay system should not exceed in any hour 1000 picowatts, 
psophometrically weighted. Again allowing four interferences per relay 
route from such a source, the interference per station becomes 18 dba 
at reference level. Fig. 3 of Ref. 2 shows that for this value of interference 
from the earth station transmitter, the mean value of carrier-to-inter- 
ference ratio at the converter of the TH receiver should not be less than 
58 db, providing the earth station transmitter frequencies are inter- 
leaved between the TH frequencies. 
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Since the normal received carrier power of the TH receiver is —27 
dbm, the interfering carrier must be —85 dbm or less. Although the 
measurements were made at 4170 me, the estimated system loss is less 
than one db different at 6000 mc. Thus the transmission loss with the 
earth station antenna elevated 7.5° is about 163 db, as before. There- 
fore, the power into the earth station antenna at Andover could be as 
great as 48 dbw or 63,000 watts without exceeding the mean interference 
recommendation. 

Recommendation No. 357 also provides that the interference into a 
relay system should not exceed 50,000 picowatts psophometrically 
weighted one minute mean power for more than 0.01 per cent of any 
month. On the basis of four exposures, this reduces to 50,000 picowatts 
for not more than 0.0025 per cent of any month. 

The latter permissible power recommendation is 23 db greater than 
that for the per-exposure objective for a mean power of 250 picowatts 
as developed above. Fig. 21 shows that signal enhancements of 38 db 
were observed for 0.0025 per cent of the observational period. This 
implies that for these specific sites the 0.01 per cent recommendation 
is 15 db more restrictive on earth station power than the mean inter- 
ference recommendation. This leads to a maximum transmitter power 
of 33 dbw or 2000 watts. 

If the antenna at Cornish were not beamed directly at Andover, the 
maximum transmitter power at the latter site could be raised by the 
antenna directivity loss thus obtained. For the 10-foot horn-reflector 
antenna, this amounts to over 40 db for 10 degrees of angle.® 

Furthermore, with a nonstationary satellite the probability of the 
earth station antenna being oriented at the interfered-with radio relay 
receiver, as assumed above, is quite low. This will reduce substantially 
the probability of the interference reaching 50,000 picowatts, and in 
turn will lead to a maximum transmitter power considerably greater 
than 2000 watts. 


7.2 West Paris—Andover Path 


The separation between the West Paris test transmitter and the 
Andover station was only 23.5 miles, and the median received signal 
with the antenna at the latter site elevated 7.5 degrees above the hori- 
zontal was about —112 dbm. With a standard TD-2 transmitting ar- 
rangement the power would have been about one db Jess or —113 dbm. 
This lacks 10 db of meeting the objective of —123 dbm developed above 
from the C.C.I.R. recommendation for mean interference in any hour. 
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This additional transmission loss could be provided by about 2 degrees 
of antenna discrimination® at West Paris. 

A TH receiver operating at West Paris potentially could suffer inter- 
ference from an earth station transmitter operating in the 6-kmc com- 
mon carrier band. The basic transmission loss in this band is estimated 
to be within about one db, as in the 4-kme band. Using the objective 
of —85 dbm developed above, and with the West Paris antenna 2 de- 
grees off beam, the permissible power at Andover, as far as interference 
at West Paris is concerned, could be about 31 dbw. 

No long-term data were obtained on this path from which one might 
predict the probability of occurrence of short-duration signal enhance- 
ments. However, it is quite possible that these might limit the maxi- 
mum tolerable power at Andover to a lower value than that derived on 
the basis of tolerable mean interference. 


VIII. ACKNOWLEDGMENTS 


The writer wishes to acknowledge his appreciation of the efforts of 
the Eastern Area of the Long Lines Department of the American Tele- 
phone and Telegraph Company in setting up the test transmitters at 
Cornish and West Paris. He is also indebted to D. KX. White of Bell Tele- 
phone Laboratories who was responsible for conducting the tests, col- 
lecting the data and assisting in the analysis of it; also to A. L. Durkee 
of Bell Telephone Laboratories for his estimates of the basic transmis- 
sion loss of the two paths and to J. N. Hines for his directivity patterns 
of the Andover antenna.’ 


REFERENCES 


1. Response of the A. T. & T. Co. to the F.C.C. “In the Matter of an Inquiry 
into the Allocation of Frequency Bands for Space Communications.” Docket 
No. 13522. 

. Curtis, H. E., Interference Between Satellite Communication Systems and 
Common Carrier Surface Systems, B.S.T.J., 41, May, 1962, p. 921. 

. Roetken, A. A., Smith, K. D., and Friis, R. W., The TD-2 Microwave Radio 
Relay System, B.S.T.J., 30, October, 1951, p. 1041. 

. Defined in Recommendation No. 241, [Xth Plenary Assembly of the C.C.LR., 
Los Angeles, 1959. 

. The TH Microwave Radio Relay System, B.S.T.J., 40, November, 1961, entire 
issue. 

. Curtis, H. E., Radio Frequency Interference Considerations in the TD-2 
Radio Relay System, B.S.T.J., 39, March, 1960, p. 369. 

. For a detailed description of the Andover, Maine, earth station receiver with 
its antenna, the reader is referred to the Telstar issue of the B.8S.T.J., 42, 
July, 1963, which appeared after the present paper was submitted. 


on i Or no) i A) 


The Nature of and System Inferences 
of Delay Distortion Due to Mode 
Conversion in Multimode 
Transmission Systems 


By S. E. MILLER 
(Manuscript received July 1, 1963) 


Quantitative estimates of delay distortion due to mode conversion in a 
multimode medium are made using an analysis based on modes coupled 
through power-transfer coefficients. This results in a simple translation 
from the spatial distribution of mode conversion to delay distortion without 
an intermediate step in the frequency domain. The expected value of the 
reconversion magnitude and its delay distribution relative to a driving 
impulse are found for (7) the case where the undesired mode loss is dis- 
tributed (helix damped modes and higher-order circular electric modes) 
and (it) the case where discrete mode filters are inserted (as in smooth- 
walled waveguide). Numerical estimates are given for TE in 2-inch I.D. 
guides at 55 kme. 

For both cases the power in the reconverston echo varies directly as the 
system length, and the shape of the echo 1s independent of length. For the 
case of distributed undesired mode loss the echo to tmpulse-excitation has 
an exponential shape in relative delay r, varying ase", and for the case 
of partially absorbing mode filters the echo is a line-segment approximation 
to an exponential in r (Fig. 4). The characteristic delay constant ro is about 
0.035 nanosecond for helix damped modes in an all-helix line, and is 
about 0.106 microsecond for TH: in etther helix or smooth-walled guide. 
For solid-walled guide with mode filters every 300 feet the characteristic 
delay constant (similar to ro) is about 2 nanoseconds. 

Estimates are made for signal interference effects from such echoes, 
taking account of the fact that the most limiting requirements on echoes in 
some system arrangements occur at r >> ro, where the reconversion power 
is small. For PCM in smooth-walled copper waveguide with mode filters 
every 300 or 150 feet, it 1s concluded that pulse rates of 200 or 400 megabits 
might be used, with up to 20 or 40 miles respectively between regenerators; 
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beat wavelength straightness variation mode conversion is controlling. For 
PCM in an all-helix waveguide, tt 1s concluded that a pulse rate up to 5000 
megabits and up to 746 miles between regenerators 1s permitted by mode 
conversion effects; diameter variations (T'Ko2 conversion) are controlling. 

For transmission of frequency division multiplex via a frequency-modu- 
lated carrier (FDM-FM), estimates based on the discrete-echo theory of 
Bennett, Curtis and Rice suggest that 4000-mile transmission of 2000 
channel groups 1s possible in all-helix waveguide; diameter variations are 
controlling. An rms frequency deviation of about 15 me (total band about 
150 mc) would provide 40 db interchannel interference ratio at 900 miles, 
and larger deviations (c) increase the allowed system length (2) according 
to 2 & (a)?8. Hven in solid-walled waveguide there is a good prospect for 
4000-mile FDM-FM using guide tolerances already achievable. 

Separate consideration is being given to delay distortion due to wave- 
guide cutoff dispersion, which will be appreciable in some configurations 
described and will require equalization. 


I. INTRODUCTION 


This paper gives some quantitative estimates of the delay distortion 
due to mode conversion to be expected in multimode transmission lines 
such as the millimeter-wave circular electric waveguides. In principle, 
the results apply to any multimode system, including optical guided- 
wave systems; in the latter case, however, it is likely that the magnitude 
of delay distortion will be too small to be a limitation in practice. In 
millimeter waveguide systems, delay distortion effects can be important. 

A very simple analytical approach is used, based on modes coupled 
through power-transfer coefficients. This permits a direct translation 
of the spatial distribution of mode conversion into the effect on the 
output waveform without recourse to the frequency domain as an inter- 
mediate step. The case treated herein is mode conversion that is an 
independent random function of distance, but the physical reasoning 
employed can be used to indicate the changes which alterations in the 
conversion distribution would cause. A perturbation method is employed, 
and limits are found for the distance for which this perturbation calcula- 
tion is valid. 

In a companion paper Dale T. Young! has shown how the power- 
transfer coefficients of the coupled-mode equations used herein are 
related to the more familiar coefficients between the amplitudes of 
coupled modes; this makes more quantitative the relation between the 
present work and the work of H. E. Rowe and W. D. Warters.? In 
another companion paper, L. H. Enloe’ presents a precise technique for 
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analyzing the delay distortion effects due to mode conversion, starting 
with the work of Rowe and Warters? and D. T. Young;! additionally, 
Enloe has extended the work of Bennett, Curtis, and Rice® to facilitate 
the calculation of interchannel interference in FM systems in the 
presence of the exponential echo which is characteristic of mode con- 
version and reconversion effects. 

Two different systems situations will be included in the numerical 
evaluations: (2) analog systems such as FM, wherein the important 
transmission line length is large, perhaps 4,000 miles, and (77) regenera- 
tive systems, PCM, wherein the pertinent line length is the distance 
between regenerators, perhaps 15-50 miles. 

Previous consideration has been given to FM on waveguide by 
Kazuhiro Miyauchi’ and R. Hamer.’ The present work seeks to add to 
our understanding by using impulse excitation and a simple physical 
model to clarify the nature of the distortion produced by mode con- 
version-reconversion effects and to make evident the controlling param- 
eters. By using experimental results in combination with analysis, it 
turns out to be possible to define in a broad way the system capabilities 
of existing waveguides without specifying many of the exact waveguide 
tolerances. Also presented is a first treatment of the delay distortion 
effects in smooth-walled waveguides with partially absorbing mode 
filters inserted periodically. 


II. PRELIMINARY FORMULATION OF THE PROBLEM 


To evaluate delay distortion we assume the transmission of an im- 
pulse in the desired signal mode. Power will be converted to an undesired 
mode and reconverted back to the signal mode after some relative delay 
or advancement. Use will be made of the three-mode system previously 
employed® and sketched in Fig. 1. We designate the signal power as 


Z = DISTANCE 


— 
PR (0)= IMPULSE ih Ne SIGNAL MODE 
— > 
ax 
4 (2) UNDESIRED MODE 


RECONVERTED-WAVE 
MODE 





P,, (0) =0 


Pn(o)=0 


Km 


Fig. 1 — Diagram representing mode conversion and reconversion in multi- 
mode transmission systems. 
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P;(z), which decays exponentially as ¢ “!*'"”’? due to heat loss ay, and 
conversion loss aq, coefficients. The undesired mode power P,(z) has 
a heat loss coefficient a,, and a conversion loss coefficient a,;. The third 
mode P,(z2) contains the reconverted power, and has a heat loss co- 
efficient a), and conversion loss coefficient az. In Ref. 1 free conversion 
and reconversion is permitted between mode x and mode n, while 
conversion out of P; is permitted but no reconversion back from mode x 
to mode 1 takes place. Thus the signal mode total power is (Pi + P.), 
but it is convenient to separate P; and P,, to identify the power which 
has undergone conversion at least once (Pn). 

In the present pulse analysis we will employ only one mode conversion- 
reconversion sequence and identify the limits of applicability of this 
approximation. 

We use power flow directly, as in the appendix of Ref. 8, and assume 
continuous random coupling between the modes. From this we get the 
expected value for the reconverted power under the conditions assumed; 
this is a useful first step, although more detailed knowledge would be 
desirable. 

The key to this analysts is the fact that the converted pulse in the x mode 
suffers simultaneous delay and attenuation relative to the signal mode. 
No matter where in the line a component is converted and reconverted, 
a relative delay 7 is always associated with a relative attenuation 
¢ “#I-4" where 4 is the distance the power travels in the x mode. 
Also, 


ies (1) 
Vz V1 
where v, and 2; are the group velocities in mode x and mode 1 respectively. 
The magnitude of the reconverted power density P,D relative to 
the signal pulse P,; is (for zg > a minimum distance to be specified ) 


P,D(1) = K exp [= (zh = ayn) 7] (2) 


where K is to be determined. By using (1) we can replace 7 in (2) by 
T, giving the reconverted power density versus 7 (Fig. 2). We see 
immediately that the echo power due to conversion and reconversion 
varies as ¢ /"°, where 7) depends only on the group velocities and the 
heat loss coefficients. Equation (2) can be integrated over all values of 
n, the distance travelled in the x mode, and equated to the average 
reconverted power P,, to evaluate K. 


[ P,D(n) dn = average power in mode n. (3) 
“0 


The next section establishes the right-hand side of (3). 
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P, 


TIME 


TIME 
a 


Fra, 2 — Signal impulse (P:1) and reconversion echo wave (Pn) versus time 
after transmission through a multimode line with mode conversion. 


III. DETERMINATION OF THE AVERAGE RECONVERTED POWER 


We use here the results of an earlier analysis.2 We assume continuous 
excitation of the signal mode at the sending end, and write the following 
equations for power flow between modes 


os = — (au, + az) Py 

Zz 

oP 

E:; 

“ = — (an, = az)P, + dnP:. 


In Ref. 8 these equations are solved for Pi(z = 0) = land P,(z = 0) = 
P,(2 = 0) = 0; dividing that work’s equation (21) by equation (19) 
we obtain 


= S {5 [-(a — 8) + Vi exp BI-(8 — @) + Vid 


bo] 


(4) 


eee eer en ec ee ee vial 


WIR 


where 
a= a,+ Ai 
B = Gn TP Get 
Vi= Va — BY? + daar - 
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It can be verified that P,/P, grows without limit for z — © because 
the exponent [—(8 — a) + ~/ is positive. This is an expected result 
due to the fact that conversion and reconversion take place between 
mode x and mode n, whereas power converts one way from mode 1 to 
mode x. 

We now seek to limit z in such a way that (4) is small compared to 
unity. We assume 


Aig = ag (5) 
a <K (Geh = ny (6) 
Then (4) reduces to 
sigan ere exp [a122/(dzn — ain)) + eee 
P; (dn — Qin)? 
F (7) 
exp [—(a@zn — du)z] — a= ao €xp [aie 2/ (Gen — an)] 
Now, by requiring 
(Gzn — Gin)z > 1 (8) 
we drop the third term of (7), and by letting 
2 
ee <I (9) 
(Qn — Qi) 


we can expand the second term of (7) (noting the fourth term is negligi- 
ble compared to the second), yielding 

Pi ir 2 

See 10 

Py (Gen ame Gin) ( ) 
Conditions (6) and (8) are not restrictive on the maximum length z 
of the system and are very typical. Condition (9) is restrictive on z, 
and by requiring it to be true, we discover P,/P; has the same form. 
Hence, so long as P,/P, calculated from (10) is small compared to 
unity, it is a valid calculation. Numerical values will be inserted at a 
later point. 


IV. JUSTIFICATION OF FIRST-ORDER PERTURBATION SOLUTION 


We now consider the validity of using only the first conversion and 
reconversion term. Using equations (19), (20) and (21) of the appendix 
to Ref. 8, it can be shown that when restriction (8) of this paper is met 
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it is also true that 


Pr + Pn _ (dsr — an) 
Pz tz 


(11) 


which is independent of z. We will see that this is typically a number of 
the order of 100 to 1000 or more. As stated above, when (9) is also 
true, P, «< Pi, and hence 


Py _ zh — Qin 


dae Az (12) 


This is believed to be important. P; decays simply as exp [— (ain + aiz)z] 
independently of all reconversion effects; P, therefore serves as a con- 
venient reference, and the other powers P, and P, are normalized with 
respect to it. For any 2 [subject to (8) and (9)] P,/P. is a constant. 
With increasing z, power is being converted continuously from P; to P, 
and from P, to P,, but P,/P; is constant despite the fact that P,/P, 
grows through orders of magnitude. It must follow that negligible power 
in mode x comes from mode n in the range for which (9) is valid. Since 
negligible power in mode x was ever in mode n, the second- and higher- 
order processes of conversion and reconversion can be ignored. 


V. RECONVERSION POWER DENSITY VERSUS RELATIVE DELAY 
Equation (3) can now be written [using (10)] 


2 
(Grn = ain) 


i K exp [— (azn — au)y] dyn = Py: (13) 


Performing the integration to evaluate K and substituting back into 
(2) gives 


P,D(n) = Pyar z exp [— (Gen — in) 7] (14) 
or, using (1) to eliminate 7 in (13) 
Pig. 1 1 
P,D(r) = 7 iN exp | — (aa — an)z/(t — ‘)| . (15) 
Vz V1 
In either case we have assured that 
ce) a) Piz 
P,,D(r) dr = P,,D(n)dy = ————. . (16) 
0 0 (Grn = in) 


It is convenient to define a time constant 7) at which the reconversion 
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power density is down to 1/e times the 7 = 0 value 


= 1 LY 5h (v. _ v1") 
7 (2 +) = 2c (Gzr — in) oe) 
(azn —_ in) 


where 

c = velocity of light in the dielectric in the waveguide 

vy = Vk,/xd 

d = waveguide diameter 

k, = Bessel root appropriate to the cutoff of mode y, and 

\ = intrinsic wavelength characteristic of the dielectric in the wave- 

guide. 

The part of (17) involving »’s instead of group velocities v assumes the 
waveguides are far enough from cutoff that 


a 1 
VHP aay: 


In some system applications it may be acceptable to ignore recon- 
version tails closer to the signal pulse than some time interval 7,. It 
is of interest then to sum the reconverted powers for 7 from 7, to «, 
which is the shaded area in Fig. 3. From (15) 


ve) 2 
Pidiz 2 
Py = PaD(r) dr = 2 exp (n/n. (18) 
Tr (zn = ain) 
PnD 
T—TIME DELAY RELATIVE TO \ 
THE SIGNAL PULSE \ 





Fig. 3 — Density of reeonverted wave power (P,D) versus time when there is 
coupling between modes with distributed differential loss. 
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VI. EFFECT OF MODE FILTERS ON DELAY DISTORTION 


The preceding paragraphs cover the case of distributed differential 
loss. When the undesired mode loss is lumped, as is the case when mode 
filters are used, (1) still holds, but we need a new expression to replace 
(2) to represent the density of reconverted power as a function of the 
distance 7 that the power traveled in the undesired mode. The appro- 
priate relation is 


Cin 
§ 


A | a(2 es ") aa oi? 2 i) | s<n<2s (19) 
- afoe(s—2) +.07(2-2)]as << 


in which s is the mode filter spacing, C; is the power transmission co- 
efficient for the mode filters, and the number of mode filter sections n 
must be large compared to (p + 1), where ps < 7 < (p + 1)s. The 
proportionality constant A is analogous to K of (2), and is to be eval- 
uated by equating the integral of P,D(7) over all 7 to the average 
reconverted power P,, . To do so we evaluate P, (assuming djz = Gz) 


P,D(n) afi-7+ Jtoro<a<s 


P, = i aizP(y) dy = n | azPAy) dy + (n — 1) 
2s 3s (20) 
/ a32C1P;,(s) dy + (n a 2) [ dizC 1 P,(s) dy + ences 


in which y is the distance in the direction of propagation, analogous to 
z of Fig. 1. We use a loose coupling approximation and assume random 
coupling effects as above. We can assume the loss in the driven mode 
(no. 1) is small in the length s between mode filters, so that we can 
write P,(y) in the interval ps < y < (p + 1)s, p any integer, 


(y—ps) 
Pilgy Pie) I Hap ACP Ry, (21) 


We evaluate (20) using the approximation that there is little difference 
in attenuation (other than the mode filter loss) between modes 1 and x 
in the length 7 which is ultimately important 


2 2 2 
Be Nas 8 [1 + 2a — Or , 2 — 2) 4 a, (22) 


n 
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For n > k, where C;" is the highest-order term of importance and z = 
ns = total length of line 


2 
diz 82 (1 + C4) 
2 acy va) 


Pe 





This expression is analogous to (10) and P; is again the power at z in 
the driven mode. We now evaluate A of (19) by equating the two 
expressions for P, 





2 fe} ore) 
Pi 5 aC) =) ; P,D(n) dn =f P,D(r) dr (24) 
where 
Ur 2 
s Ts (25) 


(26) 


ne 
Tt = 8(——— 
Vv V1 


my sfl 2 af 2 

mm c 5} Vy 5} V1 . 
Using (19) for P,D() we perform the integration of (24) and find, 
approximately 


A= (iz 2P : (27) 


We can now put (19) in final form 


P,D(r) = Pia,,2 1-24+%losrss, 
T 


ll 
x 
2 
8 
x 

VA ee OO ne | 
Q 
“pw 
| 
Pan a 
No 
ae 
Q 
IN 
| 74 
| 
—_ 
Wea 
Lo 


or more generally 


P,D(r) = Pye 2 | o: (: + 1- *) + Cr (: — t) | 
Ts Ts (28a) 
kr,g S 7S (kK +1)7; 
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and k = 0, 1, 2, 3, 4, etc. This function is plotted in Fig. 4. The recon- 
version power density declines linearly from its 7 = 0 value to C; times 
its r = 0 value at r = 7,. For tr; < +r < 27, the reconversion power 
density again declines linearly from Cir 2P to Cya4.72P, , and similarly 
for larger +. 

It is again of interest to evaluate the total reconversion power from 
tT = t, to r = © because echoes at very short time delays are not 
damaging. The ratio of reconversion power for 7 > 7, to the total 
reconversion power may be shown to be 


| P,»D(n) dy = Pra / P,»D(n) dn = CP (29) 
8 0 
and similarly for reconverted power at rt > 27, 


(ove) [o) 2s 
[ P,D(n) dyn = | P,D(y) dn — P,D(n) dn =CYP,. (30) 


s 


Thus for 7, = mr, the total reconverted power from 7 = 7, to r = © 
declines as Cy”. 


VII. DISTANCES FOR WHICH THE APPROXIMATIONS HOLD 


Two approximations concerning z have been made, relations (8) and 
(9). The minimum z comes from (8). We calculate for three waveguide 
cases of interest 





Fig. 4 — Density of reconverted wave power (P,D) versus time when there is 
coupling between modes with lumped differential loss. 
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(7) helix damped modes in a 2-inch I.D. all-helix waveguide, 

(iz) modes (such as TI.) which are attenuated only by the normal 
copper losses in the walls; this situation is the same in helix waveguide 
or in solid-walled copper waveguide, where again we use the 2-inch I.D. 
case, and 

(72) 2-inch I.D. solid-walled copper guide with a 15-foot helix mode 

filter every 300 feet. 
Condition (8) appears because we want to make exp [— (az, — ay)z] 
negligible compared to unity; we will use as a criterion for minimum 
z (Zmin) that (az, — Gin) 2min = 3. The table below gives the typical 
values for (az, — di) and the resulting 2min. For case (727), 2min >> ms, 
where m is chosen so that C1,” < 1; this assures that the line-segment 
exponential reconversion trail (Fig. 4) is fully developed. Except possibly 
for case (22) these restrictions are not very limiting. 

The maximum 2(Zmax) for which these estimates are valid comes from 
(9) and (10) or (28); we take P,/P; = 0.1 as the definition of 2max , 
although somewhat larger 2’s might give meaningful results. The values 
of (az, — Qn) from Table I (known from measurements on physical 
waveguides) are again used. The values of conversion coefficient a1; 
are known from measurements on experimental waveguides at the 
Holmdel, N. J., Bell Laboratories for cases (7) and (277) but are not 
known from measurement for case (22). For the latter the theory of H. 
I. Rowe and W. D. Warters’ was used, in combination with known data 
on the diameter variations of our good waveguides; a 1-mil rms diameter 
variation was taken, and the Warters-Rowe calculation for the average 
loss for random discrete imperfections on 10-foot joint spacing used to 
get the case (72) value of az in Table II. These are conservative values 
when applied to the 55-kme region; Holmdel waveguides have shown 
comparable conversion coefficients at higher frequencies. 

For all-helix waveguide, it appears that these estimates should be 
good for a cross-country analog system; for copper tubing with mode 
filters the estimates are good for 100 miles or thereabouts. 





TABLE I 
Waveguide Case Co Resi ay ieee fe /2* (@xh — ain) 2min 
(2) 1 db/foot 0.23/ft. 13 feet 
(it) 4.3 db/mile =1/mile 3 miles 
(tit) 3000 feet 





* With the necessary conversion factors to give the units shown. 
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TABLE II 
Average Amplitude 
Waveguide Case asada Loss a1z/(azh — ath) aig Zmax miles 
a1z/2* 
(i) 1 db/mile 1/5280 0.23/mile 2290 
(it) 0.005 db/mile 1/860 0.00115 per mile 74,600 
(202) 1 db/mile 0.23/mile 62 





* With the necessary conversion factors to give the units shown. 


VIII. NUMERICAL VALUES FOR RECONVERSION TIME CONSTANTS 


In this section we will evaluate the relations previously derived for 
the time constants of the reconverted powers, using parameters typical 
of millimeter-wave circular electric waveguides. 

It should be noted that the reconverted power will precede as well 
as lag the signal impulse. Significant mode conversion takes place be- 
tween TE, and TE due to straightness variations, and the group 
velocity for TE, is greater than that for TE . This will not be labored 
at greater length here; a typical group velocity difference will be calcu- 
lated, and part of the reconverted power for straightness variations will 
precede the signal and part will lag the signal. 

It should also be noted that the time variation of the reconversion 
power as given by (15) and (28) is independent of distance z within the 
limits ¢ S 2max calculated above. The magnitude of reconversion is of 
course dependent on z. 

Equations (17) and (26) show that less delay occurs when the velocity 
difference is smaller, and this is in turn obtained when » is smaller, i.e., 
the guide is farther from cutoff. From (17) or (26) the reconversion 
duration varies as 1/f’d’ where d = waveguide diameter and f = fre- 
quency. The estimates given here are for 2-inch I.D. guide at 55 kme. 

Consider the delay distribution of mode conversion due to random 
unintentional straightness variations, which causes coupling to the 
TMi, TEn , and TE, modes. Because TEi:2 gives the longest echo trail, 
we use it as the mode for the computation of (v, — ’)/2 and evaluate 
(17) and (26) as shown in Table III. 











TaBLe III 
Waveguide Case (vz2 — 12/2 Delay Evaluation 
(t) (all-helix) 0.0079 ro = 0.0348 X 1079 see 


(zit) (solid copper with mode filters) 0.0079 Ts 41 X 10°° sec 
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The other waveguide condition of interest, case (77), either helix or 
solid-walled copper with circular electric mode conversion, gives an 
exponential reconversion shape; (vy, — )/2 = 0.0197 (for TE, — 
TEo2), (@z. — din) corresponds to an attenuation difference of 4.3 
db/mile, and r> = 105.8 X 10°” second from (17). 


IX. SYSTEM EVALUATION OF MODE CONVERSION DELAY DISTORTION 


Some comments can be made on the relation between expected system 
requirements and reconversion power magnitude and delay distribu- 
tion, even though further analysis in the case of analog systems will be 
necessary to get precise numbers. 


9.1 PCM onan All-Helix Waveguide Line 


Consider first an all-helix waveguide line. Typical values of conversion 
coefficient a, and heat-loss difference (az. — din) have already been 
given in connection with Tables I and II. The first row of Table I applies 
for random straightness deviations, and the second row applies for 
diameter variations. If we require a total P,/P ratio (summed over all 
delay times) of —30db (as might be desirable in a PCM system of 
infinite pulse rate) we find from (18) with 7, = 0 that a distance z of 
22.9 miles is permissible as far as straightness variations are concerned, 
and z = 746 miles is permissible from the point of view of diameter 
variations. The shorter distance controls, and we can conclude for 
helix waveguide there is no limit on the pulse width permitted by re- 
conversion delay distortion in a PCM system with regenerators every 
20 miles. We can go further, however. For reconversion delays very 
short compared to the pulse rate there will be no interference between 
time slots; the reconverted power shows as a variation in apparent loss 
in the line and provided that this loss shift is stable, the reconverted 
power can be relatively large. Suppose we let z = 4000 miles; then the 
total P,/P,; = —7.6 db with respect to straightness variations, and the 
total P,/P,; = —22.6 db with respect to diameter variations. Using 
(18), we can compute the 7, for the total P,,/P, in the range 7, <7 < © 
to be —30 db; we find 7,/r) = 5.16 or r, = 0.18 X 10°” second with 
respect to straightness variations, and 7,/7> = 1.7 or r, = 180 X 10° 
second with respect to diameter variations. This implies that straightness 
variations (or other effect causing conversion to modes damped by the 
helix) do not limit and diameter variations do limit the length between 
regenerators in PCM systems with all-helix waveguide. By backing off 
from z = 4000 to z = 746 miles, the desired total P,/P; = —30 db 
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including reconverted power at all time delays; thus satisfactory PCM 
operation with regeneration every 746 miles is permitted by reconversion 
delay effects in an all-helix waveguide line for pulse rates up to about 
10°/0.18 or 5000 megabits/second. 


9.2 FDM-FM onan All-Helix Waveguide 


The work of Bennett, Curtis, and Rice’ allows us to estimate the 
limits of transmitting a single-sideband group of telephone channels by 
FM (FDM-FM) as is done on the TD-2 and TH radio relay systems. 
Their work shows that the RF signal-to-echo ratio need not be as large 
as the interchannel interference requirement, and since the length of the 
permitted system is inversely proportional to the required RF signal- 
to-echo power ratio (Ref. 5, equation 18) this can greatly change the 
permissible system length. The work of Bennett et al.’ (their Fig. 5.7) 
shows that when the ratio o/f,, (rms frequency deviation) / (highest 
baseband frequency), is unity or more the requirements on reconverted 
power are essentially independent of delay for r > 0.2/f, (fo = highest 
baseband frequency) and becomes very rapidly more tolerant of recon- 
verted power for smaller delays. At o/f, = 1 an FM advantage of about 
7 db for large time delays is given by Bennett et al., and for larger o/f, 
the FM advantage increases at the rate of 8.4 db/octave.* It seems 
reasonable to assume an FM advantage of at least 10 db, and more 
probably could be realized. This means that for interchannel interference 
of —40 db, Par/P1 = —30 db can be used, and since the requirement 
falls off rapidly below vf, = 0.2, an assumed 10-mc SSB baseband signal 
(2000 channels) allows us to use 7, = 20 X 10” seconds in computing 
the permissible system length from (18). The resulting estimate is 900 
miles, maximum system length for a 10-me group of SSB-AM channels 
to have —40 db interchannel interference ratio when sent with an FM 
index o/f, of about +/2 on an all-helix waveguide described by cases (2) 
and (27) of Tables I and II. (The limitation is TE.) Mr. L. H. Enloe has 
made a much more rigorous analysis and found a similar result.’ 

Since the FDM-FM application is limited only by conversion to higher 
circular electric modes, there would be considerable advantage in adding 
mode filters for such modes.° This is certainly possible in principle but 
has not been reduced to practice. Estimates of requirements for circular 
electric mode filters can be deduced from (23), (26) and (28). 


* This and the succeeding discussion mean that the FM advantage and the 
allowed system length vary as the 2.8 power of the frequency deviation when long- 
delay echoes are controlling. 
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9.3 PCM on Solid-Walled Copper Waveguide 


Turning now to PCM transmission in solid-walled copper waveguide 
with (noncircular electric) mode filters, the circular electric mode con- 
version limitations are the same as in an all-helix waveguide line. Typical 
conversion loss and heat loss numbers are given in case (272) of Tables I 
and II. For straightness variations (modes damped by the mode filters) 
and for P,/P; = —30 db including all time delays, the permissible 
distance between regenerators is 0.62 mile for 300-foot mode filter 
spacing; P,/P,, and hence permissible system length, varies inversely 
as the mode filter spacings. This form of line will not handle indefinitely 
large PCM pulse rates for 15-20-mile repeater spacings. For 300-foot 
mode filter spacing the length of reconversion pulse (7, in Fig. 4) is 
rs = 2.5 X 10°’ second (per Table III); a PCM system using a pulse 
rate of 200 megabits would probably be acceptable up to distances at 
which the ‘‘second” (7, < 7 < 27,) reconversion pulse becomes limiting. 
For 15-db mode filters,* this would be approximately a distance z such 
that 


or 2 & 20 miles between regenerators. For operation where the third 
(27, < 7 < 375) reconversion interval is controlling — i.e., at PCM rates 
of about 100 megabits — the length z can be 


10 log (z2/0.62) = 30 db 


or 2 = 620 milesT between regenerators. Alternatively, by using mode 
filters spaced 150 feet instead of 300 feet, the permissible pulse rate 
could be 400 megabits at 40 miles. We found above that diameter 
variations permit a length between regenerators (Pi/P, = —30 db) of 
746 miles, and hence this imperfection is not limiting for high-speed 
PCM in solid-walled guide with mode filters. 


9.4 FDM-FM on Smooth-Walled Copper Waveguide 


For the same guide with FDM-FM transmission where we may re- 
quire [in (23)] P,/P: = —380 db as above, the length z permitted by 
straightness variations is 0.62 mile (for all delay times considered inter- 
fering). If we ask what 7, would make the permitted system length 
controlled by straightness variations equal to 900 miles, the figure found 

* A 15-foot section of 2-inch I.D. Holmdel helix would have 15 db loss to TEi2 
and TE, at 55 kme and even more loss for TMi , the other significant mode. 


+ Here we have run beyond the 100-mile limit of applicability of the perturba- 
tion theory and the result needs further justification. 
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permissible due to diameter variations, we find [as indicated in the dis- 
cussion associated with (29) and (30)] 


0.62 

500 

Letting C, = 0.0316 (corresponding to a 15-db mode filter), we find 
7,/Ts © 2and 7, 5 X 10°” sec. Neglecting echoes with time delays 
less than 5 X 10° seconds is compatible with the assumptions made 
above for the /DM-FM transmission and appears to be a real possibility. 
However, to go to 900 miles system length is in violation of the zmax = 
62 miles computed earlier, and indeed it is clear that the perturbation 
calculation is no longer valid. We might inquire what would be the 
practical effect of longer systems than that for which the perturbation 
calculation holds.* When all of the power in the driven mode has been 
reconverted once, the driving signal would no longer be an impulse but 
might roughly be a waveform of the character sketched in Fig. 4. Then 
further conversion and reconversion sould broaden this waveform 
again. As a guess we might expect one perturbation-theory time-constant 
of delay distortion for every distance such that P,/P, ~ 1. For the case 
being considered, P,/P; 1 at z = 620 miles, and in this vicinity the 
above calculation showed a 7, of 5 X 10°” seconds would be adequate 
to meet requirements; if we now allow 7, = 20 X 10° seconds as done 
above (th, © 0.2) we might expect to tolerate (20/5) = 4 times 600 
miles system length. Thus, even in solid-walled guide with mode filters 
the reconversion power may be concentrated at such small delays as to 
permit FDM-I*M for 900 miles and possible greater distances. This must 
be regarded as a tentative estimate, a major uncertainty being the 
nature of the delay distortion in a multimode system in the region where 
several successive conversions and reconversions must be accounted for 
(i.e., perturbation theory not valid). 


Ci" — 


* Another interesting question concerns the way echo power will be related to a 
possible ‘breaking’? phenomenon at the FM demodulator. This is quite different 
from the breaking which occurs due to thermal noise. For echo delays which are 
short compared to the reciprocal of the modulating baseband frequency, the 
writer has concluded that no appreciable fluctuation in the envelope results, and 
the echo power has the principal effect of causing a static shift in carrier magnitude 
which would differ among members of an ensemble of waveguide lines but which 
would be approximately fixed (varying perhaps slowly due to temperature effects, 
drifts in carrier frequency, etc.) in any one line. For echo delays which are com- 
parable to the reciprocal of the modulating baseband frequency, envelope fluctua- 
tions do result and limiters can be used to advantage. Since some waveguide modes 
produce reconversion echoes which occur predominantly at delays short compared 
to the modulating periods, the total tolerable reconversion power may be signifi- 
cantly greater than the tolerable thermal noise at the FM demodulator. Further 
study of specific situations is needed to answer the question. 
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In practice there is another factor which will make solid-walled wave- 
guide less speculative for FDM-FM. As L. C. Tillotson has pointed out, 
one can demodulate the signal often enough so that the interference 
calculated from perturbation theory is valid. This is more than a ruse. 
By demodulating and filtering, the interchannel interference terms re- 
sulting from very short delay echoes are discarded. After remodulating, 
the short delay echoes are missing and hence they cannot go through 
successive steps of delay distortion to become longer delay echoes in 
the manner discussed above. Thus by demodulating at approximately 
the intervals at which the perturbation theory breaks down (100 miles 
approximately) one would filter out the short delay echoes and prevent 
their ever causing crosstalk. Since one usually drops and adds channels 
at intervals of 100-200 miles, it is possible that the demodulation pro- 
cedure would not be a serious limitation in many cases. 


xX. CONCLUSIONS AND DISCUSSION 


Using a simple power-flow analysis, expressions have been found for 
the magnitude and delay distribution for the expected value of the 
reconverted power in a multimode transmission system with mode 
conversion. Three cases are examined in detail: 

(t) the helix-damped modes in an all-helix line, 

(77) circular electric mode conversions in either helix or smooth- 
walled guide, and 

(z2t) the mode filter damped modes in a line made up of smooth- 

walled copper plus mode filters. 
For numerical examples, 2-inch I.D. guides at 55 kme are assumed. 
The minimum and maximum lengths of line (between terminals in an 
analog system or between regenerators in a PCM system) for which the 
analysis is valid are given in Tables I and II respectively. 

For impulse excitation, the reconversion echo as a function of delay + 
relative to the signal pulse varies as ¢”’”° as given in Fig. 3 for cases 
(2) and (22), and varies as a line segment approximation to an exponen- 
tial illustrated in Fig. 4 for case (7277). For mode conversion due to 
straightness variations (TE), ro in helix waveguide is about 0.035 
nanosecond and 7, (Fig. 4) in copper guide with mode filters spaced 
300 ft. is about 2.4 nanoseconds. For diameter variations (TEp:) in 
either type of guide 7 is about 106 nanoseconds; this could be radically 
reduced by the addition of circular electric mode filters, but system 
estimates herein do not count on that, since such mode filters have not 
yet been developed. 
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Only the echo power at delays greater than some relative delay 7, 
is subject to severe requirements in typical system layouts, and 7, > 79 
for certain imperfections in either helix waveguide or smooth-walled 
waveguide. The reconverted power P, , relative to signal power P; at 
the same distance 2, for r, < 7 < © is given in (18) for cases (7) and (72) 

Py, = [ PD) dr = Pits? oxy (—7 /t0) (18) 

o Tr ‘ (Qn Ts ain) é 

where diz is the average mode conversion coefficient and az, and a, are 
the heat loss coefficients in the undesired mode and signal modes respec- 
tively. Since reconverted power at delays less than some value 7, will 
usually have negligible system effect, (18) evaluates the important 
reconversion power. For case (227) similar results are given by (28), 
(29) and (30). 

For the transmission of PCM on an all-helix line, the discussion of 
Section 9.1 indicates that diameter variations limit the regenerator 
spacing to about 746 miles and random straightness variations limit 
the pulse rate to 5000 X 10° per second. For analog transmission an FM 
advantage on the echo interference as computed for a single echo by 
Bennett, Curtis and Rice® is essential to get reasonable requirements 
on the RF signal-to-echo ratio. (See Section 9.2.) Using the theory of 
Bennett et al., a rough estimate indicates that 2000 channels could be 
sent a distance of 900 miles using an rms frequency deviation of +~/2 
times the top baseband frequency, 10 mc; larger deviations would yield 
longer systems at a rate 6.9 times the system length for 2:1 increase in 
deviation. The length limit is again set by diameter variations. A more 
precise technique for computing the interchannel interference due to 
the exponential echo trails which are characteristic of multimode sys- 
tems, Figs. 2 and 3, is given by L. H. Enloe in a companion paper.® 

For the transmission of PCM on smooth-walled guide plus mode 
filters, the discussion of Section 9.3 indicates that pulse rates of 200 
or 400 megabits with mode filters spaced 300 feet or 150 feet might be 
used with distances up to 20 or 40 miles respectively between regenera- 
tors. Longer distances between regenerators are possible if the bit rate 
or the mode filter spacing is reduced. In Section 9.4, analog transmission 
of the same FDM-FM signal noted above (2000 channels) may be 
possible in smooth-walled guide with mode filters for the same system 
lengths and deviation ratio noted above. 

All of these estimates are very sensitive to the magnitude of the con- 
trolling imperfections in the medium; the permissible system length 
varies inversely as the fourth power of the geometric distortion magni- 
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tude. The echo duration estimates are also made for 2-inch guide at 
55 kme, and vary inversely as (frequency)? for fixed heat-loss coefh- 
cients. In practice, differential heat loss decreases with increasing fre- 
quency, tending to reduce the broad-range frequency variation in echo 
duration. 

Separate consideration is being given to delay distortion due to 
waveguide cutoff dispersion, which will be appreciable in some system 
configurations described and will require equalization. Equalization 
of the delay distortion due to mode conversion is also being considered 
separately. It is interesting to note that each delay difference 7 [of 
(28) or (15)] corresponds to a unique value of 7, the distance the energy 
travelled in the undesired mode, and that the phase difference between 
all reconverted components and the unconverted (or “straight-through’’) 
signal component is also unique for a given r. The random magnitudes 
(but not phase angles) of the conversion coefficients and the random 
location along the axis of propagation may make this surprising at first. 
There may be hope of equalizing out the expected value of the recon- 
version echo, but we must keep in mind that the variance of the echo 
magnitude will limit the degree to which this is possible. In this paper 
the system estimates are based on no equalization of the mode conver- 
sion effects. 

A constructive discussion of FM with L. C. Tillotson, C. L. Ruthroff, 
and L. H. Enloe is gratefully acknowledged. 
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Model for Relating Coupled Power 
Equations to Coupled Ampli- 


tude Equations 


By D. T. YOUNG 
(Manuscript received July 24, 1963) 


Following a suggestion of H. EH. Rowe, the first-order system of coupled 
differential equations with the independent variables representing amplitudes 
of coupled modes (the so-called coupled mode equations) has been rigorously 
transformed into similar equations with the independent variables repre- 
senting the power in the coupled modes for the case where the amplitude 
coupling coefficient 1s a Gaussian random process with a white noise spec- 
trum and zero mean value. Coupled mode equations in power have proven 
useful, and this work provides formal justification and identification of the 
variables and parameters. 

The equations may allow one to approach coupled-mode problems directly 
for statistical results. 


I. INTRODUCTION 
Consider the coupled line equations: 
Ty (2) = —Tolo(z) + je(z)h(z) (1) 
Ty'(z) = je(z)Io(z) — Tih(2). (2) 


These equations are useful in describing effects of coupling between 
a signal mode, represented by a complex wave amplitude [o(z), and a 
single spurious mode, represented by J;(z), caused by geometric imper- 
fections in a multimode transmission line. These equations may be de- 
rived in two ways from basic principles:'”""* direct conversion of Max- 
well’s equations to generalized telegraphist’s equations, or allowing 
discrete converters to become continuous. Exact solutions are known in 
only a few special cases, one of which is the case of constant c(z). Mak- 
ing use of this solution and assuming c(z) is a stationary Gaussian ran- 
dom function with a white noise spectrum So and zero mean value, it is 
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possible to derive coupled differential equations with the expected values 
of the power in Jo(z) and J;(z) as independent parameters. That is 


Py (2) = ( —2ap — So) Po(z) + SoPi(z) (3) 
Pi (z) = SoPo(z) + (—2a, — So) Pilz) (4) 
where 


1 1 1 
These equations are similar to equations used by S. I. Miller.® 


II. DESCRIPTION OF MATHEMATICAL MODEL AND COMPUTATION 
Consider (1) and (2) and assume the J)(z) are known and c(z) has 
1 


the constant value c in (z,z + Az); then J, o(2 + Az) are given by* 
ba + mi = (ettg | 
Iy(z + Az) I;(z) 

where G is a two-by-two matrix with elements 
gu = (2 +i) exp E (1 + ja)ae 

+ (‘ zit) exp E (1 - jae | 
go = (: = ‘) exp E (1 + jae 

+ (2 +i) exp BE (1 — jae | 
gi = gu = (5) exp E Chae jaz 

— (5) exp E (1 - jae | 


AT = Ty — Ty = Aw + jAg. 

















where 
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Now compute (Jo/9*) and (J,/,;*) from 


Tol o* ToIo* 
Ioly* a Toly* 
= (eT (G &% G*) 
ID o* ThI* 
hI* z+Az iT,* zZ 
where X denotes the Kronecker product (direct product) of the matrices 


G and G*.° If we compute ensemble averages, assuming the random 
function c(z) is independent of previous values and (c(z)) = 0, we have 


Pyo(z + Az) = €°**{APo(z) + BP,(z)] (5) 
Py(z + Az) = €°°“"IBPo(z) + DPi(z)] (6) 
where 


Po(z) = (ol2)Zo¥(2)) 


A = (gugu*) 
B= (gigi2*) 
D = (G20G22*) 


since (gugi2*) = (gigu*) = (912g2*) = (go2g2*) = 0, being odd functions 
of c. Expanding in powers (c’) and Az and requiring 


lim (c")Az = So 


<c2>+0 


and after considerable algebraic manipulation, which we omit, we obtain 
(3) and (4) 


Poe) - (— 2a = So) Pol) = 0g SoP1(z). 


These equations may be related to Miller’s equations by replacing our 
Po, Pi, 200, 2a1, and Sp by his P; + P,, Pz, din + Gan, G2,, and diz. 
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Interchannel Interference in FM Systems 
Produced by Continuous Random Mode 
Conversion in Circular Electric 


Waveguide 


By L. H. ENLOE 
(Manuscript received June 14, 1963) 


Long-distance, high-capacity transmission via the TE, mode in circular 
waveguide ts an attractive goal because the theoretical attenuation caused by 
heat loss decreases monotonically as the frequency of operation increases. 
However, continuous random mode conversion caused by the manufacturing 
or the laying of the waveguide causes severe random fluctuations with fre- 
quency tn the transfer function of present-day waveguide, a fact which 
apparently eliminates all but the “toughest”? modulation schemes, such as 
pulse code modulation and angle modulation. 

In this paper, we present the derivation of a technique for analyzing the 
effects of continuous random mode conversion on an angle-modulated wave. 
The random coupling coefficient is assumed to have a Gaussian probability 
density. The modulating signal is assumed to be a single-sideband frequency 
division multiplex which may be simulated by a band of random Gaussian 
notse. The quantity of interest ts the expected value of the interchannel 
interference notse which appears at baseband. A self-contained section on 
principal results and examples 1s presented. The examples use typical 
state-of-the-art data and are significant in that they demonstrate that FM 
appears to be an attractive modulation scheme. They are, however, not 
meant to be an exhaustive study of the problem. Copper waveguide (with 
and without mode filters) and helix waveguide are considered. 

The Fourier series, multiple-echo representation of a stochastic transfer 
function presented in this paper should prove useful in handling other 
modulation schemes as well as other random channels. 


I. INTRODUCTION 


Long-distance, high-capacity transmission via the TEo: mode in the 
millimeter-wave circular-electric waveguide is an attractive goal be- 
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cause the theoretical attenuation caused by heat loss decreases mono- 
tonically as the frequency of operation increases. However, in order to 
obtain sufficiently small heat loss attenuation and delay distortion, the 
operating frequency must be well above the TEo cutoff frequency and 
far into the multimode region. For example, a two-inch I.D. perfect 
copper circular guide will have a theoretical TE heat loss of 1.54 db 
per mile at 55 kme and will propagate 223 additional (spurious) modes.! 
Real waveguides are never perfect right-circular cylinders but have 
continuous random imperfections arising from the manufacturing or the 
laying of the waveguide. These random imperfections cause conversion 
of the TE signal mode to spurious modes.? We call the attenuation 
produced by this energy loss ‘‘conversion attenuation.”? Usually more 
important, the imperfections also cause converted energy in the spurious 
modes to be reconverted into the TI signal mode. Since the group 
velocities of the various modes will be different, the reconverted energy 
from each mode will be delayed or advanced in time with respect to the 
energy which has remained in the TE signal mode. Thus we have a 
signal plus a continuum of echoes. The effect causes the amplitude and 
phase of the TE wave to fluctuate randomly with frequency. A transfer 
function which is a random variable can be defined to take this fluctua- 
tion into account in a statistical sense. We call it the ‘reconversion 
transfer function.” 

The fluctuations in the reconversion transfer function distort any 
signal which might be transmitted through the waveguide. A “tough” 
modulation scheme is required in order to overcome this effect. In this 
paper we consider the use of large-index frequency modulation (FM). 
The recent work of Rowe and Warters? provides us with a sound and 
tractable mathematical analysis of continuous random mode conversion. 
We shall use the results of their work as our starting point. The base- 
band signal used to frequency or phase modulate the carrier wave is 
assumed to consist of a single-sidcband frequency division multiplex 
of telephone channels. For analysis purposes we can approximate this 
baseband signal by a band of random Gaussian noise. When such an 
angle-modulated wave is passed through a network whose attenuation 
and phase are nonlinear functions of frequency, interchannel interference 
appears in the demodulated signal. In way of explanation, imagine that 
the energy in the baseband modulating signal corresponding to some 
particular telephone channel is removed. When such a signal is impressed 
on a frequency modulator and the resulting wave is transmitted through 
the network in question, detected, and finally demodulated, the received 
output in the originally clear channel represents interchannel interfer- 


INTERCHANNEL INTERFERENCE 2767 


ence. Measurements of this type have been discussed by Albersheim 
and Schafer.’ 

To the author’s knowledge, there is only one tractable technique for 
determining interchannel interference in angle-modulated systems which 
does not require that the fluctuation rate of the attenuation and phase 
functions be slow over the bandwidth occupied by the signal. This 
exception is the method of equivalent echoes.>:* Since the phase and 
attenuation of the reconversion transfer function fluctuates rapidly 
across the bandwidths of interest, we are necessarily confined to this 
technique or variations thereof. We see that some sort of multiple-echo 
approximation to the reconversion transfer function is required if any 
accuracy in our results is to be expected. How many echoes are required, 
and how should they be distributed and weighted? The answer is clear 
when we recall that a small echo produces or is equivalent to a sinusoidal 
variation with frequency in the transfer function, i.e., the Fourier trans- 
form of the echo. We simply expand the reconversion transfer function 
in a Fourier series over a frequency interval which we know to be suffi- 
ciently large to contain all of the significant energy in the FM wave. The 
Fourier transform of this series is the desired train of echoes and is an 
exact representation over the bandwidth under consideration. The echoes 
are regularly spaced, and their amplitudes are proportional to the magni- 
tude of the coefficients of the Fourier series. The problem is now reduced 
to obtaining the Fourier series coefficients, and until recently this would 
have been a major stumbling block. Fortunately, however, Rowe, 
Warters, and Young have recently made important contributions?:7* to 
the understanding of continuous random mode conversion. They have 
demonstrated that over the moderate percentage bandwidths in which 
we are interested, the effects of random mode conversion give rise to a 
transfer function which is a random function of frequency much as a 
noise wave is a random function of time. Using their results, we can 
evaluate the statistics of the Fourier series coefficients, i.e., echo ampli- 
tudes, which are, of course, random variables. Moreover, if the cross- 
correlation between coefficients is negligible, we can determine the ex- 
pected value of the interchannel interference for each echo individually 
and then add the results to obtain the total expected value of the inter- 
channel interference. The random coupling coefficient is assumed to 
have a Gaussian probability density. The effects of delay distortion 
caused by waveguide cutoff dispersion are not included in this paper. 
The system is assumed to be equalized. 

Section II presents the principal results of this paper and contains 
examples demonstrating their use. The examples use typical state-of- 
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the-art data obtained from 8. E. Miller? and are significant in that they 
demonstrate that FM appears to be an attractive modulation scheme 
for use on the millimeter-wave circular-electric waveguide as far as 
continuous random mode conversion is concerned. Both helix waveguide 
and copper waveguide with mode filters are considered. The helix wave- 
guide has superior performance, of course, but copper waveguide with 
periodically placed mode filters may be useful in some situations. 
Section III presents the mathematical development of the results. 


II, PRINCIPAL RESULTS 


The present section contains a statement and discussion of the princi- 
pal results of this paper. We will restrict the discussion to the case of a 
single spurious mode and a single polarization. If other polarizations are 
present, they can be considered individually and the results added on a 
power basis.* The same is true for our purposes for different spurious 
modes. It is certainly true when the spurious modes arise because of 
different and uncorrelated physical imperfections in the waveguide. Tor 
instance, the TEo2 spurious mode is caused primarily by random diam- 
eter variations. Its reconversion transfer function would be uncorre- 
lated with that of the TE, spurious mode, which is caused primarily 
by random straightness variations, if the diameter variations are un- 
correlated with the straightness variations. Even when the spurious 
modes arise because of the same physical imperfections, it can be shownT 
that the cross-correlation in the same frequency interval is negligible. 

There are two cases of interest. In the first, the difference between the 
heat loss attenuation constant of the TEo: and the spurious mode, i.e., 
the differential attenuation constant AA, is uniform. An example would 
be the TEo, family on both the helix waveguide and copper waveguide 
with helix mode filters. The second case of interest is when the differen- 
tial attenuation is effectively zero, except at periodically located points 
along the waveguide where a total differential attenuation of k is in- 
troduced. An example of this type of behavior would be the TE, family 
on a waveguide having periodically placed helix mode filters. 


2.1 Uniform Differential Attenuation Constant 


Consider a length LZ of waveguide having a differential attenuation 
constant of AA for the spurious mode of interest. Assume that the ran- 
dom coupling coefficient between the TE, and spurious mode is Gauss- 


* See Ref. 2, Section 4.3.2. 
t See Ref. 2, Appendix G. 
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jan, pure real or pure imaginary, and has a white power spectrum of 
density S,. If the differential attenuation of the total length of wave- 
guide is much larger than one, ie., | AA | Z > 1, then the mean-square 
value of the equivalent echoes 7,2, normalized by the signal power, is 
given by the expression 


pe = "(1 / 20) (BelPoe ‘ae 
if P,/P) « 1, where 


P,/Po = reconversion echo-to-signal power ratio (2) 
= S/L/2| AA | (2) 
™ = | 3. — do” | /4c | AA | (3) 


and 


n = positive integer designating the nth echo. (The echoes lag the 
signal if i, > % and lead if i, > d,. Because the interchannel 
interference is independent of this, we use the absolute value | 0.’ — ay’ | 
in (3) and require n to be positive.) 

P, = reconversion echo power, i.e., the power in that portion of the 
signal which has been converted and reconverted once and only once. 

Po = signal power which has suffered no conversion. This will hence- 
forth be considered the signal power. 

do, 02 = mode cutoff factors* of the TE, and spurious modes at 
the band center frequency. This give rise to the differential phase shift. 

c = velocity of light in free space. 

T = 1/Af = time delay between successive echoes. 


Equation (2) has also been derived independently by Miller,’ using a 
different approach. Equation (1) was developed from a [ourier series 
expansion of the reconversion transfer function over the frequency 
interval Af. Hence, Af must satisfy certain conditions if meaningful 
answers are to be obtained. First, all of the significant energy in the 
angle-modulated signal must be contained within the bandwidth Af. 
Thus we require a bandwidth of 


Af 2 2(fo + 4c) (4) 


where f, is the baseband bandwidth and o is the root-mean-square fre- 
quency deviation of the angle-modulated wave. This is simply a state- 
ment of the familiar Carson bandwidth. Second, the mathematical 





* See Ref. 2, Appendix A. 
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derivation of (1) requires the differential phase constant to be approxi- 
mately proportional to frequency over the bandwidth used in the Fourier 
series expansion. This sets an upper limit on Af. In order to satisfy this 
requirement we require that the percentage bandwidth and the square 
of the mode cutoff factors of the modes involved be small compared to 
unity, i.e. 


Af/2fo K 1,39 K1 and 3, «1. (5) 


This will usually be true for the bandwidth and modes of interest, but 
it should always be checked as a safeguard. Third, the Fourier series 
coefficients must be uncorrelated with one another. It is well known that 
Fourier series coefficients become uncorrelated as the expansion interval 
approaches infinity. In a practical case, “infinity” is a number which 
is much larger than the correlation interval of the random variable in 
question. In the present case the autocorrelation function of the recon- 
version transfer function varies as 1/[1 + (2m7of’)"], where f’ is meas- 
ured about the band center frequency. Thus in order for the cross-cor- 
relation between coefficients to be negligible we require that 


(rroAf)? > 1. (6) 


Lastly, the expression (1) for 7,’ is limited by the accuracy of the per- 
turbation theory from which it was developed. It should be good, how- 
ever, as long as P,)/P: > 1 as calculated from (2). A plot of the equiva- 
lent echo train is shown in Fig. 1. We see that 7» is the ‘“‘time constant” 
associated with the envelope of the echo train, i.e., it is the value of 
time delay where the envelope of the echo train caused by reconversion 
has dropped to the relative value 1/e. 

If the cross-correlation between Fourier series coefficients, and hence 
echoes, is negligible, we may substitute the expression (1) directly 


r2 


go NS = eo T/ To) 
(1/7) (R./R) 





Fig. 1 — A plot of the reconversion echo train for uniform differential attenua- 
tion. 
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into the formulas given by Bennett et al.’ and sum the results to deter- 
mine the total baseband interchannel interference in the top channel. 

As an illustration, for a two-inch I.D. helix waveguide at 55 kme, the 
dominant spurious mode is the TE: , as far as reconversion echoes are 
concerned. Typical data are” 


| AA | = 4.3 db/mile = 0.495 neper/mile 
S, = 0.01 db/mile = 0.00115 neper/mile 
dy = 0.0172 <1 
i: = 0.058 K 1 
7 = 105.8 X 10 ‘sec. 


= 
| 


For a waveguide length of 4000 miles, we have from (2) 
P)o/P2 = 22.6 db > 1. 


Let us assume that a signal is applied to this waveguide which has been 
frequency modulated by a rectangular band of Gaussian noise. Let the 
baseband width be f, = 10 mc, and the rms frequency deviation be 
o = 2f, = 20 me. The significant energy in this wave would be contained 
in a bandwidth of Af = 100 = 20f, = 200 me. Thus, if we pick Af = 
200 me we also satisfy (5) and (6), i.e. 


Af/2fo = 0.002 K1, = (wm Af)” = 4,450 > 1. 


From Fig. 5.7 of Bennett et al.” we can read off the factor s(nfoT’), 
which multiplies the mean-square value of each echo to give the inter- 
channel interference-to-signal power ratio for that particular echo. 
s(nf,xT) takes into account the noise suppressing property of FM as 
well as the variation caused by the magnitude of the delay. A plot of 
s(nfoT) is shown in Fig. 2 for o/f, = 2 and 3. The total interchannel 
interference-to-signal power ratio is given by the sum 


ice] 


I/S = Di) s(nfT)r2 (7) 


n=1 


if P:/Po K 1. We see that since s(nf,7') is essentially constant at —15 
db for nf,T’ > 1/10, i.e., n > 1, we have to a good approximation 


I/S 


I 


s(nfoT’) a 


s(nfoT’) (P:/Po). 


(8) 


I 


In decibels 
I/S = —15 — 22.6 = —37.6 db. 
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Fig. 2— Weighting function s(nf7’) which multiplies individual echoes to 
give interchannel interference in top baseband channel for FM. 


This corresponds to 27.4 dba at zero level. Thus, other things neglected, 
continuous random mode conversion does not preclude a 4000-mile 
two-inch I.D. helix system. 

If this interchannel interference noise-to-signal ratio were too large, 
one could decrease it still further simply by increasing the modulation 
index. In other words, for the conditions of this example we are able to 
obtain FM improvement against reconversion echoes as well as additive 
thermal noise. However, from lig. 2 we see that for nf,J’ « 1/10 we 
would not be able to obtain this improvement. In fact, increasing the 
modulation index degrades the situation in that case. 


2.2 Mode Filters 


Mode filters may be placed at periodic intervals along the waveguide 
to provide large attenuation to the spurious modes. We assume here 
that each filter contributes k nepers of differential attenuation and that 
the waveguide between filters contributes essentially none. Let the 
spacing between mode filters be Zp and the length of each mode filter 
be negligible in comparison. In the previous section, for uniform differen- 
tial attenuation we found that the envelope of the equivalent echoes, 
when plotted versus delay, was an exponential. In the present case we 
find that the envelope consists of a piecewise approximation to an 
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exponential using straight line segments (see Iig. 3). The exponential 
is « * and the straight line segments connect the points defined by 
letting 2 = 0, 1, 2, --- , ete. Let us number the trapezoids represented 
by the area under each straight line segment according to the value of 7 
at the beginning of each trapezoid. The equation for the straight line 
in the ith trapezoid, i.e., between € *” and € **”* on the ordinate, is 


yi(r/n) = EL + [IL — Ei — (7/n) I] (9) 
where 7 is the length of the base of each trapezoid. The expression for 
the individual echoes is 
i oe TS. LLo —2ik 


ny 


1+ 1 — ei — (n7/n1)]I (10) 


TI 


where z is chosen from the inequality 7 S (n7T/7,) S i + 1. Here 
Ly is the length of waveguide between mode filters, L is the total length 
of waveguide and 7, = | 3, — ip | Lo/2c. All other quantities are as 
defined in the previous section. As in the previous section, we require 


that 
i <K1, OF K1, Af/2%foK1 and Af = 2(f,+ 40). (11) 


The requirement on Af to ensure negligible cross-correlation between 
the Fourier coefficients of the reconversion transfer function, i.e., be- 
tween individual echoes, is changed, however. The autocorrelation 
function is given by (42). The series converges quite rapidly for large 


ni 


= j cae) 





Fig. 3 — For periodically placed mode filters of k nepers differential attenua- 
tion each, the envelope of the equivalent echoes consists of a straight-line ap- 
proximation y; (7/71) to the exponential «?*. 
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differential attenuation per mode filter k. In most cases of interest only 
the first term or two is required. In order for Af to be much larger than 
the correlation interval we require 


(rr, Af/2)* > 1 (12) 


since| ¢ | Lo = fn . 

Equation (10) requires the number of mode filters N to be large, 
ie., N > 1, and that the square of the total differential loss of the N 
filters be large, ie, ©” « 1. Under these conditions, the reconversion 
echo-to-signal power ratio is given by 


P,/Py) = (LL)S8,/2) coth k (13) 


which must be small compared to unity in order for (10) to be accurate. 
As a check, notice that if k « 1, we have 


P./P) = LS. /2| MA | (14) 


where | AA | = k/Ly. This is precisely the expression presented in the 
previous section for a constant differential attenuation. 

It is frequently necessary to determine the ratio of the power in the 
ith trapezoid to the total. ‘rom simple geometric considerations this is 
given by 


P,,/P.= € [1 —é™ (15) 


Equations (13) and (15) have also been derived independently by S. E. 
Miller’ using a different approach. 

As an illustration, consider a two-inch I.D. copper waveguide system 
at 55 kme having 15-db mode filters every 200 feet. The TEn , TEx. , 
and TM), spurious modes would all contribute significantly because of 
random straightness deviations. The energy in all of these modes will 
be lumped together and considered as if it were in the TEy2. mode. This 
mode has the largest delay and consequently would produce the most 
interchannel interference. Thus we are assured that our calculations 
will be pessimistic. Numerical values were obtained from Miller” 


ce 
I 


15 db = 1.73 nepers 

In = 200 feet = 1/26.4 miles 

S. = 2 db/mile = 0.23 neper/mile 
i. = 0.0172 <1 

Dd, = 0.0334 <1 

16.08 XK 107?° see. 


ec 
a 
| ll 


4 
fe 
I 
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Equation (10) requires that the signal-to-echo ratio be large compared 
to unity. If we arbitrarily select Po/P, = 17 db, we find from (13) that 
the permissible length of waveguide is L = 18.7 miles. If we now de- 
modulate and pass the resultant signal through a baseband filter of 
bandwidth f, to obtain the FM advantage, we obtain a new baseband 
signal not unlike the original. Thus we can remodulate and proceed 
for another 18.7 miles without analytical complications, as long as the 
baseband signal-to-interchannel interference ratio remains large. It 
should be emphasized that placing a demodulate-remodulate repeater 
every 18.7 miles is done to keep out of analytical difficulties. We have 
not shown and are not implying that performance requires this crutch. 

As in the previous section, assume an I°M system with f, = 10 me and 
a/fy = 2. We choose Af to satisfy the inequalities (11) and (12). Pick- 
ing Af = 2 kmce yields Af/2fp = 0.018 « 1, Af > 2 (f, + 4c) = 200 me 
and (r7Af/2)” = 25 >> 1. In order to calculate interchannel interference, 
we might again multiply each echo by the factor s(nf,7') given by Fig. 
5.7 of Bennett et al.’ and sum the result. However, that paper gives the 
analytical expression 


s(r) = 2(a/fo)” (afor)* (16) 


which is valid for the small delays of this illustration, i.e., 27fyn7’ «< 1 
and (2xfjnT)” (o/f,)” K 1. We can integrate rather than sum. Thus in 
(10) we replace n7 by 7 and T by dr. For the interchannel interference- 
to-signal ratio contributed by the zth trapezoid, we have 


I,/S = | Peta) ek r 
T1 


{i+ 1 — e*)e — (r/n)]} ae 


= | Beta Beoh “nT1€ 


pyre le + GHA) A +0" 


—{6+7i(—e*)} 7]. 


It turns out, in this example, that the 7 = 0, 1 and 2 trapezoids all con- 
tribute significantly, with the 7 = 1 trapezoid producing the most inter- 
channel interference. In physical terms, the value of 7 tells us how many 
mode filter spacings the converted energy traveled in the spurious mode 
before reconverting. The zth trapezoid represents energy which traveled 
a distance larger than 7 mode filters spacings but less than 7 + 1 spac- 
ings. In this example the 7 = 1 trapezoid has substantially less power 
in it than the z = 0 trapezoid, but its larger delay more than makes up 
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for this difference (see Fig. 2). The total signal-to-interchannel inter- 
ference ratio in the top channel is S/J = 66.4 db. For a total system 
length of 4000 miles, we would need 4000/18.7 = 214 repeaters and 
would obtain a signal-to-interchannel interference ratio of 66.4 — 10 
logio 214 = 48.1 db if the only contribution were due to continuous 
random mode conversion. This corresponds to 22 dba at zero level. 

This is only part of the story. The TEo, spurious modes are not 
damped by helix mode filters. If it is conjectured® that the coupling 
coefficients, differential attenuation, etc., for these modes are the same 
for copper and helix waveguides, then the calculations and results of 
the previous section apply directly. The interchannel interference pro- 
duced by the TEo2 spurious mode predominates over that from all other 
modes. Consequently, it looks as though the performance of this copper 
waveguide with mode filters is comparable with that of the helix wave- 
guide. The imposed requirement of demodulate-remodulate repeaters, 
however, is certainly a disadvantage. One must determine whether or 
not they are really necessary before any definite comparison can be 
made. 


III. MATHEMATICAL DEVELOPMENT 


Our approach is to represent the effects of continuous random mode 
conversion by an infinite number of uncorrelated echoes. These echoes 
follow directly from the expansion of the reconversion transfer function 
in a I*ourier series over the frequency interval of interest. The coupled 
line equations serve as our starting point. They are 


dIo(z)/dz = —YVolo(z) + jce(z)I2(z) 
dl,(z)/dz = +je(z)Io(z) — T.L2(2) 


where 


I)(z), I:(2) = complex wave amplitude of the TEo. mode and spuri- 
ous mode, respectively, 

I), I. = deterministic propagation constants of the TE, and spuri- 
ous modes, respectively, 

c(z) = random coupling coefficient, assumed Gaussian and pure real 
or pure imaginary, and 

z = distance along the waveguide. 


If one imposes the initial conditions J)(0) = 1 and J,(0) = 0, then 
I)(z) is the transfer function of the waveguide. This may be factored 
into two components. One component, ¢ *”’, is deterministic and repre- 
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sents the attenuation and phase which would be present in the absence 
of random mode conversion. Since in theory it is completely predictable 
and equalizable, we shall not consider this component further. The other 
component is a random variable and represents the attenuation and 
phase which is contributed by random mode conversion. This is by defi- 
nition the reconversion transfer function and is given by 


Golz) see "I y(z). (19) 


Rowe’ shows that it may be approximated directly by use of Picard’s 
method of successive approximations, or with improved accuracy it 
may be obtained from the reconversion propagation function A(z) 
defined by the relation Gp(z) = € “”, where A(z) is approximated by 
Picard’s method. We use Rowe’s second approximation throughout this 
paper 


A(z) = i c(s)et*"™* ds [ c(t)e oT dt. 


The reconversion propagation function A(z) may be written as 
A(z, & 4A), where € = AG/2r = (Bo — Br)/247, AA = Ap — Az and 
To = Ao + 780, Ts = Az + 782. Notice that AA will always be negative 
since A, is positive and larger than Ay. A is nonstationary when con- 
sidered as a function of z. Under certain conditions, however, A is ap- 
proximately stationary, when considered as a function of £, over the 
small percentage bandwidths of interest.” The requirement is that the 
random coupling coefficient c(z) have a white power spectrum; that is, 
the autocorrelation function R.(u) = c(z)c(z + u) be an impulse func- 
tion or a close approximation thereof. The bar denotes an ensemble 
average. This seems to be true or at least approximately true for good 
waveguides over the moderate bandwidths required in our problem. 
Fortunately, it also is true that over the small percentage bandwidths 
in which we are interested, é is approximately proportional to frequency ;* 
that is, 


E=H+e =m(fr—-f) (20) 


where & and fy are measured at the band center, and £ = & + & and 
f=ftf.m = (di; — i) )/2c where % and 3, are the mode cutoff 
factors of the TE and spurious modes, respectively, evaluated at the 
band center frequency fp , and c is the velocity of light in free space. 

* From Ref. 2, Appendix A, — = (f/2c)[ fa — vo2)? — (1 — v,?)3] = (f/4e) (0.2 — 


Vo?) = constant/f if vo? <1 and ve? <1. Letting f = fo-+ f’ and — = & + & we 
have &’ + & = (constant/fo) (1 — f’/fo) = m (fo — ff’). 
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Thus we may determine the mean-square value of the coefficients of 
the Fourier series expansion of the reconversion propagation function <A, 
when A is considered as a function of frequency, as a limiting case of 
the power spectrum of A, when A is considered as a function of & The 
functional form of A depends upon whether é or f is the variable. For 
this reason we shall use a subscript 1 when é is the variable, and a sub- 
script 2 when f is the variable. 

Formulas for the power spectrum S,(u) of the random variable 
Ai(é) are developed for the case of AA = 0 by Rowe and Warters,” 
for AA = constant by Young,’ and for the case of periodically placed 
mode filters later in this paper. 


3.1 Transfer Function and Equivalent Echoes 


The complex reconversion propagation function A2(f) may be ex- 
panded in a Fourier series over the interval Af centered at fo 


+00 

BO) 5 ye (21) 
where Ao,(f) = Ac(f) for fo — Af/2 Sf S fo + Af/2. T = 1/Af and 
the subscript s reminds us that we have a periodic series approximation. 
If all of the significant energy in the I'M wave, or any other wave for 
that matter, lies within Af, then the periodic series (21) may be used 
for calculations. The n ~ 0 terms represent fluctuation components. 
If their sum is always small compared to unity with large probability, 
then the reconversion transfer function Go(f) = exp [—A2(f)] can be 
approximated by the series 


+00 ; ais | 
Go:(f) = exp | - a Gn aad = Pm 1 d, DS On ul (22) 


where the prime on the summation indicates that the n = 0 term is 
omitted. The impulse response of this periodic transfer function, i.e., 
its inverse Fourier transform, is 


+0 


hos(t) = 6% Eo — 2 a,8(t + nt) |. (23) 


Thus we have an undistorted signal plus a series of small echoes. 
The coefficients a, are given by 


Sots /2 
an = (1/Af) Mepe Or ag. (24) 
fo—As/2 
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Replacing f by f’ + fo we have 
+As/2 ose 
_ Cyperus f haf ogee /Af df’. (25) 
—Afs/2 


Now, over the small percentage bandwidths in which we are interested, 
the differential phase constant is proportional to frequency: i.e., & = & + 
& = m(fo — f’). Since Ao(f) = Ai(£), we may rewrite (25) as 


iar nfol Af ee ’ jane" /(—mAf) 7.0 
Qn = ce mnfo i. che yet Tn —m d 
(ay mAf | 2 Ante $0 : (26) 
a Pe Gs yarn re 
where b,, is the coefficient of the nth term of the Fourier series expansion 
of Ai(é’ + &) over the interval —Aé/2 S ¢’ S +Aé/2. For AE much 
larger than the correlation interval of A;(é), we may approximate the 
expected value of |b,’ |, and hence | a,’ |, in terms of the power spectrum 
of Ai(£). The result is 


| an? | = (L/ | m |) Ss,(—nP/m). (27) 


The imaginary part of the complex random variable A;(£) is the 
Hilbert transform?'* of the real part, so the power spectrum Sq,(w) 
vanishes for negative u. Thus, the coefficients a, are nonzero only when 
the sign of n opposes that of m. Negative n indicates that the echoes lag 
the signal and vice versa. Since the interchannel interference is inde- 
pendent of whether the echoes lead or lag, we may drop the minus sign 
in (27), use the absolute value of m, and restrict n to positive values. 





3.2 Uniform Differential Attenuation 


The power spectrum of the real part of the reconversion propagation 
constant A,(£) is given by Rowe and Warters,’ for the case of zero dif- 
ferential attenuation and a Gaussian random coupling coefficient, as 


S(u) = eee + SZ —|ull, [vl s 


0, otherwise. 


(28) 


Since the real and imaginary parts of Ai(&) are Hilbert transforms, the 
power spectrum of Ai(é) itself is given by* 


A) LS. S.[L —~ u], 0O-)<u<L 
ma ae 6(u) + SL —-uj, (0O-) Sus oe 


0, otherwise. 


* Discussions of the autocorrelation function and power spectrum of complex 
random variables, whose real and imaginary parts are Hilbert transforms, are 
given by Dugundji, 10 Zakai,!! and Deutsch.” 
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Young® demonstrates that if the differential attenuation is nonzero 
but is a constant equal to AA, the real and imaginary parts of A,(é) 
are again Hilbert transforms and its power spectrum is found by multi- 
plying that for AA = 0 by € 7!*4"!, Tf | AA | L > 1 we have 


(S7L7/4)5(u) + S2LeE**4" (0-) Su < 


Sa, (ee) = ; (30) 
0, otherwise. 
Equation (30) can also be obtained as a limiting case of the results of 
the next section. 
It is often convenient to work in terms of the reconversion echo-to- 
signal ratio P,/P», defined as the ratio of the power in the echoes in 
(23) to that in the signal. This is given by 


P./Py = Tat] = 2, (PY Lm) )Sai(n/| me) 


— [ G/Im{)SaiCr/ ml) dr (31) 


= 8/L/2|AA|. 


3.3 Mode Filters 


Consider a long section of waveguide of length L. Assume ideal mode 
filters are placed at Ip , 2L0, --- , NI = L. Let each mode filter be of 
zero length and produce k nepers of differential attenuation and a 
constant differential phase shift of @ radians, with no other side effects. 
If the differential attenuation of the length of waveguide Ly between 
each mode filter is negligible in comparison with that produced by a 
mode filter, we may approximate the differential propagation function 
of the waveguide by the relation 


AT (2) = j27& + (—k + 70) Y a(2 — nlp) (82) 


where 27 = AS = differential phase constant of the waveguide alone. 

In the coupled line equations (18) the propagation factors Ip and 
Tr, are constants. In the present case, however, they are functions of 
position along the waveguide, i.e., functions of z. Rowe has shown in 
unpublished work that in (18) we may replace 


Toz by yo(z) = Ty(S) dS 
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and 
rz by xe) = [ r.(s) as, 
0 


and use Picard’s method of successive approximations to obtain the 
approximate solution 


a(L) = f * ed at | * of S)e2™ gs (33) 


The derivation and result are exactly the same as Rowe’s published 
derivation’ for constant I’s with the exception of the substitution men- 
tioned. Substituting (32) into (83) we have 


A(N Ip) 


7 fr 0 ices Ee — (k — 76) = u(t — ne) | dt (34) 


f c(S) exp | ~i2888 + (k — 76) > u(S — ie) | ds, 


Li, OSxZ5 0m 


where u(x) = : otherwise. 


Simplifying and rearranging, we have 


N-1 n—1 (n+1) Lo 
A(NI») —_ oe >» coe c(t)—?™* di 
ne 


n=0 1=0 20 


(1+1) Lo : 
; i e(S)e- PF ds (35) 


Lo 


N-1 p(n+1)Lo , t ; 
+ SS / c(t)e”"* a | c(S)e Pres dS 
n nLo 


n=0 Lo 


which after more rearranging and simplification reduces to 


A(NIp) 
N-—1 n—-1 ate. Lo 
=> DS a eae c(L + nL) ec?" di 
n=0 1=0 0 
Lo . 
| c(S + Ily)e 7 dS (36) 


N-1 pLo t 
+> if c(t + nly) "* av | c(S + nly) ”"* dS. 
n=0 “0 0 
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If the random coupling coefficient c(z) has a white power spectrum, 
the reconversion propagation function A is an approximately stationary 
random variable” when considered as a function of the differential phase 
variable & We shall now calculate the autocorrelation function Ra,(£) 
of A. Since c(z) has a white power spectrum of density S,, it has an 
autocorrelation function which is an impulse, 1.e. 


R.(é) = S6(é). (37) 


Thus, each term in the series (86) is uncorrelated with all others. The 
autocorrelation function of the sum is the sum of the autocorrelation 
functions of each individual term. 

The autocorrelation function of a typical term 





j : c(t + nLo)e”™** dt (38) 
is 
= sin. a Lit 
RCE) = Sola (SB 2E). (39) 


In view of the fact that 1 < n in the double series of (36), the two 
integrals in each term are uncorrelated. The autocorrelation function 
of the double series is then 

N-—1 n—1 


N-1 
»y Da eo? (k—jrtLg) R’(é) i. > (N oe ie ee R’(é) (40) 
since we have N — 1,n —1 = 1 terms; N — 2,n —1 = 2 terms; --- 
1,n —1l= N — 1 terms. 

The autocorrelation function of each term in the single series of (36) 
is the same as the autocorrelation function for a single piece of waveguide 
of length Ly . Thus, for c(z) Gaussian, it is the inverse Fourier transform 
of the power spectrum given by (29), i.e., it is 


) 


1 (0) +B (RE) + 5R(E)] (41) 


where F is still defined by (89) and the circumflex stands for the Hilbert 
transform. Keeping in mind the fact that the de terms, i.e., (0) terms, 
add in phase, we conclude 


Ra,(&) = (N*/4)R°(0) + (N/2)IR(E) + JR(2)] 


N—1 : (42) 
ae Re) pS; (N = hye eo. 
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In order to find the reconversion echo-to-signal ratio, we could inte- 
grate over the power spectrum of A;(é) as in (81). However, the al- 
ternative method of letting ¢ = O in the autocorrelation function is 
easier. Note that the 0+ limit on the integral in (31) corresponds to 
dropping the constant term, i.e., the (3)R’(0) term in (42). Thus 
the reconversion echo-to-signal ratio becomes 

N=1 


P,/Py = (N/2)R'(0) + > (N — n)e?™ (0). 


r= 


If N > 1and €*”* <1, then 


Lins: 


P,/Po = coth k (43) 





where L = NIy. 
The power spectrum of A;(&) is the inverse Fourier transform of 
(42) and is given by 
Sa,(u) = (4)N*LeS.6(u) 
not ' (44) 
+ S2 D7 (N — t)eP*((Lo — | u — Lo |)) 
1=0 


for u 2 (O—), where 


zifx>0 
((z7)) =4 
Oife <0. 


If N > 1 and ¢ ** <1, then the terms diminish quite rapidly. Thus, 
for 1 < N we have the simpler expression 
Sa,(u) = (4)N*Ly' S.6(u) 

+ NIpS.e*™ [1 + (1 — €*)(i — (u/Lo))] 
where (—0) $7 S u/Iy S i+ 1. A rough sketch of this function is 
shown in Fig. 3. The sketch consists of an impulse at the origin plus a 


straight-line approximation to an exponential. 
It is often convenient to have an expression for the ratio of the re- 


(45) 


conversion echoes under the 7th straight line, i.e., from u = iLp to 
u= (4+ 1), to the total. This is given by 
PifPe= (PafPo) / (PAP) = Ae Ye. (46) 


If we allow the differential attenuation k per mode filter and the spac- 
ing Ly between mode filters both to decrease without limit such that 
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their ratio remains finite, (43) reduces to (31), and (45) reduces to 
(30), where | AA | = k/Iy. 


3.4 Interchannel Interference 


Let the angle-modulated wave 
e;(t) = iol tie(s) (47) 
be applied to a filter having an impulse response given by (23), i.e., by 


-foo 
host) = €*[3(t) — D0’ a,d(t + nP)], (48) 


g(t) is assumed to represent stationary random Gaussian noise. The 
output wave is found by convolution and is 


Cc 
_ jogttje(t i t T p(t T. 
€o(t) = ¢ may eiae +je(t) ou ane? +nT)+jo(itn 1. (49) 
n=—o 
= nse 
From (26) we have a, = €7"° by, = &"™ 7""7r, | where rn and 6p 


are real. If there are a large number of sample points’ contained in the 
small percentage bandwidths of interest, then to a very good approxi- 
mation the @,’s will be uniformly distributed in the interval (0, 27) 
and the b,’s will be uncorrelated. Substituting, (49) reduces to 


-+oo 
€g(t) = ea teat ae) = 3: Tn Pad a a AC (50) 


n=—0O 


The output consists of the original signal plus an infinite number of 
small, uncorrelated echoes. If the total power contained in the echoes 
is much smaller than that contained in the signal, we may follow a 
procedure identical to that of Bennett et al.” to determine the baseband 
interchannel interference in the top channel as a function of time which 
results from each of the above echoes. We then determine the autocor- 
relation function of the interference and average over 7, and 6, . Since 
the @,’s are uniformly distributed, the interference caused by each 
echo adds on a power or mean-square basis. In their work, Bennett 
et al. carried along a deterministic term p7’ which would be replaced by 
6, in our case. This term appears in the final result as cos 6, , which has 
an average value of zero. The curves of Fig. 5.7 of Ref. 5 are for 
cos pl’ = O and so apply to our case directly. 
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IV. CONCLUSIONS 


We have presented an analysis of the effects of continuous random 
mode conversion on an angle modulated wave. The modulating signal 
was assumed to be a band of random Gaussian noise which is used to 
simulate a single sideband frequency division multiplex. A technique 
was presented for determining the expected value of the interchannel 
interference noise which appears in the top channel at baseband. Ex- 
amples using typical state-of-the-art data demonstrated that FM ap- 
pears to be an attractive modulation scheme. Copper waveguide, with 
and without mode filters, and helix waveguide were considered. 

It should be pointed out that the techniques and the results presented 
here should be applicable to other multimode and multipath situations 
virtually unchanged. The Fourier series characterization of the trans- 
mission channel should also be of use in the study of other modulation 
schemes. . 

Our work contains two key, but reasonable, assumptions which one 
must keep in mind: 

(t) The perturbation technique upon which our work is based re- 
quires that the reconverted energy consist primarily of energy which 
has been converted and reconverted once and only once.?” 

(tz) The random coupling coefficient is assumed to be Gaussian and to 
have a power spectrum which is white, or uniform, over a bandwidth 
which is large compared to the bandwidth “occupied” by our signal. 
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Iiffect of Differential Loss on Approx- 
imate Solutions to the Coupled 
Line Equations 


By D. T. YOUNG 
(Manuscript received July 24, 1963) 


The assumption of zero differential loss between coupling modes in a 
multimode transmission line reduces the complexity of theoretical analysis. 

Here we show that in general the approximate solution including differ- 
ential loss between modes may be computed by convolving the solution for 
the case of zero differential loss with the Fourier transform of exp (— | Aeu |). 

The rapidity of loss variation versus frequency ts limited to (2Aa/A8)f 
for transmission lines with high Aa. 


I. INTRODUCTION 
Consider the coupled line equations 
Io'(z) = —Volo(z) + je(z)h(z) (1) 
Th'(2) = je(z)Io(2z) — Tih (2). (2) 


‘These equations are useful in describing the effects of coupling between 
a signal mode, represented by a complex wave amplitude Jy), and a 
single spurious mode, represented by J; , caused by geometric imperfec- 
tions in a multimode transmission line. These equations may be derived 
in two ways from basic principles. The coupled line’ or generalized 
telegrapher’s equations’” may be derived directly, or the geometric 
imperfections may be considered discrete; the case of continuous im- 
perfections can then be considered as a limiting form of the discrete 
case." 

Exact solutions for these equations are known in only a few special 
cases, so considerable attention has been given to approximate solu- 
tions.’ A second-order approximate solution is difficult to examine in 
general; however, Rowe and Warters’” have given a very thorough in- 
vestigation for the case of equal attenuation for the two modes or zero 
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differential loss (Re [xo — T;] = 0). Rowe’® has shown in the case of a 
random coupling that the average loss for the TE in a circular guide 
mode may be calculated as the convolution of the Fourier transforms 
of three functions: an attenuation function, a triangular function, and 
the covariance function of the coupling. Here we show that in general 
the approximate solution for loss and phase of the TE); mode may be 
calculated by convolving the solution for zero differential loss with the 
Fourier transform of the attenuation function. 


II. PROOF 


Approximate solutions to (1) and (2) are more conveniently described 
by normalizing the mode amplitudes in the following way 


Go(z) = In(z)e**” 
Gi(z) = Iy(z)e™™. 
The approximate solution’ for G(z) is 
Go(z) =1—p 


where 


p= i en au | c(a)e(a + u) dx 
0 0 
AT = An+jap=%—-Ty 
and the initial conditions are 
I,(0) = G0) = 1 
I,(0) = G,(0) = 0. 


I 


The normalized loss, A = —In | Go|, may be further approximated by 
A = Rep 
and the phase 6 by 


6 = —Im p. 


2.1 Loss 


Consider first the case of real coupling c(z); then the normalized loss 
for a guide of length L is 


L L-—u 
A= 7 e“* cos ABu au | c(a)e(a + wu) da. 
0 0 
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Let 
é(x) = p(x)c(x) 
where 
p(w)=1 O8S828EL 
= 0 otherwise. 


Then we write 

R(u) =f elede(a + w) de 
F 

Ru) = [ e(x)ea + u) ae. 


We observe that (uw) so defined is an even function of uw and vanishes 
for |u| > L. 

R(u) = R(—-u) 

R(u) = 0, El ease 0 


Assuming the signal mode is the lowest-loss mode of the transmission 
line, such as the TEo: mode in circular wave-guide, we will have Aa S 0 
and may write the normalized loss as 


A(t) = a e 4%! Pw) cos 2atu du 
or 
A(¢) = 5 [. e'*! Riu) 6 7 du 


where 
276 = AB. 


Thus A(¢) is the Fourier transform of the product of two functions and 
may be written as the convolution of their individual transforms. The 
transform of the first function is 


B(s) = i glean Pi ici du 
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The transform of the second function is the solution for Aa = 0, 


Aas) = [ 2 ot ay 


= $/C(s) P 
where 
C(t) = [. e(u) 6 2?" du. 
Thus 
A(f) = B(g) * Ad(f) 
where 


B(s) « Aolt) = [ B(g)Ad(t — 8) a 


We observe that for Aa = 0, B(¢) becomes the unit impulse function 
which is the identity function for the convolution operator 


Ao(f) = Bs) * Ad(s), Aa = 0. 


2.2 Phase 


The same analysis may be applied to the normalized phase as follows 


L L—u 
(¢) = a, e°™ sin Apu du | c(x)ce(x + u) dx 
0 0 


| e“™ R(u) sin ABu du 
0 


= -5 e'*l R(u) sgn u sin ABu du 
where 
ssnu = +1, u>oO 
=-2a1, u<0O 
=i go taau sy sen uo 2% dy 
then 


a(t) = —B(r) * Ad(t) = 
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or 
ag) = ~A(t) += 


which means the loss and phase functions are related by Hilbert trans- 
forms.’ 

The extension to complex coupling is straightforward and we obtain 
the following results: 


Let 
e(x) = [C, + jCi]e(x) 
then 
2 2 1 
A(t) = BG) «| (G3? — CF)AWlt) + C2C,C:Aa(s) «| 
6(¢) = —B(f£) « | (20.01) dole) + (C7? — CF )Ag(S) “4 
and 
a(t) = —A(t) “s 
since 
or 
ae are = 6(¢) 


where 6(¢) is the unit impulse function. 


III. DISCUSSION AND INTERPRETATION OF RESULTS 


For simplicity we discuss only real coupling, but the results apply 
equally well to complex coupling. Consider the representation of c(2x) 
by a Fourier series over the length L with coefficients c,. Rowe and 
Warters* have shown that for the case of zero differential loss, Ao(¢) 
may be expressed as a double infinite summation 


Zs CmCn*(—1)"— 


ioe] 


n sin r(¢L — m) sin r((L — n) 
a(¢L — m)r(¢L — n) 


which is an expression for loss in terms of free-space wavelength, since 
at frequencies considerably above cutoff 2r¢ = Ddo(D constant). 

Ao(¢) is a band-limited function. Its sample points have relative fre- 
quency separation 6f/f = 6¢/f = 1/2¢L, so the signal mode loss may 
vary more and more rapidly with frequency as the line length increases, 


Af) = 4 


n=— 
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for Aa = 0. Since Ao(¢) is a band-limited function, A(¢) is a band- 
limited function also. If we consider ¢ as a time variable, then A(¢) is 
the result of filtering Ao(¢) with a low-pass filter whose impulse response 
is B(¢). If Aa is very small B(¢) has large amplitude at the origin and 
approaches zero rapidly away from the origin, and convolving B(¢) 
with Ao(f¢) results in little change. The most noticeable change is a 
reduction in the peaks and an increase at the original minimums of the 
loss fluctuations. However, if Aa is increased until the spacing of the 
half-height points of B(¢) is wider than the sample point spacing of 
Ao(f), the rapidity of the loss fluctuations with frequency will be con- 
trolled by B(¢). In general for high Aa the frequency separation between 
half height points for A(¢) is approximately the same as that of B(¢) 
which is independent of length 


of - (2) 
f Ag} 
A numerical example pertaining to the waveguide problem is the 


following: Consider the previously mentioned loss function for a guide 
with a two-foot wiggle. The signal mode is TE . Let 


L = 1000 ft. 
k; = 500 
Aa = —0.184 neper/ft. 


I 


Now the peak loss point is for ¢ = 1/2 which is near 50 kme for TH, in 
2-inch diameter circular copper waveguide. The Aa value is typical of 
TE in lossy-jacketed helix waveguide. Now consider the half-height 
points for Ay. This bandwidth is approximately 0.084 kmc, which is 
very narrow compared to B(¢), so that after convolution A(¢)  KB(¢). 
The half-height points for B(¢) are about 6.0 kmc. Thus the addition of 
the differential loss changes the TE; loss from a very rapidly varying to 
a very slowly varying function of frequency. This effect is of great value 
for wideband transmission systems. It is also important in experimental 
measurements, since the number of measurements necessary for a guide 
with high loss to the spurious mode is greatly reduced. 
Finally we recall that (2) = p(x)c(a) so that 


C(s) = (Le ica =i) * C(£) 
wht 
which makes the solution for a Fourier series representation of c() 
obvious, since C'(¢) would be a series of impulse functions, and the con- 
volution operation is very easy. 
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2.3 —jrtL sin ale +jartL sin Lg 
A(S) = $865) «| (Ze*™* SBA (yy) (nett BAL on(2)) | 


Tr +00 +00 


m=—0 n=—00 


sin r({L — m) sin r(¢L — ™)| 
r(¢L — m)r(fL — n) ; 
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Markov Processes Representing Traffic 
in Connecting Networks 


By V. E. BENES 
(Manuscript received June 27, 1963) 


A class of Markov stochastic processes x; , suitable as models for random 
traffic in connecting networks with blocked calls cleared, 1s described and 
analyzed. These models take into account the structure of the connecting 
network, the set S of its permitted states, the random epochs at which new 
calls are attempted and calls in progress are ended, and the method used for 
routing calls. 

The probability of blocking, or the fraction of blocked attempts, is defined 
in a rigorous way as the stochastic limit of a ratio of counter readings, and 
a formula for it ts given in terms of the stattonary probability vector p of 
x,. This formula ts 








(8) SP 
(D, a) > PrQx 


zeS 


where B, 1s the number of blocked idle inlet-outlet pairs in state x, and 
az 1s the number of idle inlet-outlet pairs in state x. On the basis of this 
formula, ut 1s shown that in some cases a simple algebraic relationship 
exists between the blocking probability b, the traffic parameter d (the calling 
rate per idle inlet-outlet pair), the mean m of the load carried, and the 
variance o° of the load carried. For a one-sided connecting network of T 
inlets (= outlets), this relation is 


1 2m 


be! = (pam s (on ae 


for a two-sided network with N inlets on one side and M outlets on the other, 
at as 


ieee Ae 
(N — m)(M — m) + 0° 
2795 
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The problem of calculating the vector p of stationary state probabilities 
as fully resolved in principle by three explicit formulas for the components 
of p: a determinant formula, a sum of products along paths on S, and an 
expansion in a power series around any point \ = 0. The formulas all 
indicate how these state probabilities depend on the structure of the connect- 
ing network, the traffic parameter \, and the method of routing. 
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I. INTRODUCTION 


A connecting system is a physical communication system consisting of 
(<) a set of terminals, (22) a control unit which processes the information 
needed to set up calls, and (272) a connecting network through which 
calls are switched between terminals. . 

Connecting systems have been described heuristically and at length 
in a previous paper.! Also, some of the algebraic and topological prop- 
erties of connecting networks have been studied in another paper.? 
The models to be used here have been described (but not studied) in 
a third paper.’ These papers are a source of background material for 
reading the present one; familiarity with them is desirable, but is not 
presupposed. 

The principal problem treated here is the exact theoretical calculation 
of the grade of service (as measured by the probability of blocking) 
of a connecting network of given but arbitrary structure; the calcula- 
tion is to be carried out in terms of a mathematical model for the opera- 
tion of the network. The model used here is a Markov stochastic process 
x, defined by some simple probabilistic and operational assumptions. 
The problem is first reduced to calculation of the stationary probability 
vector p of x; from the “statistical equilibrium” equations. From the 
form of this reduction it follows that in many cases of practical interest 
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the probability of blocking is uniquely determined by the mean and 
variance of the carried load, a fact heretofore known only for very simple 
systems. 

In the past, the application of A. K. Erlang’s very natural method of 
statistical equilibrium has been visited by a curse of dimensionality, 
that is, by the extremely large number of equations comprised in the 
equilibrium condition. This difficulty has not only put explicit solutions 
apparently out of the question; it has even made it effectively impossible 
to reach a reliable qualitative idea of the dependence of the blocking 
probability on the structure of the network, the method of routing, etc. 

Three explicit formulas for the solution p of the equilibrium equations 
will be given. One is based on purely algebraic considerations, and the 
others largely on combinatory and probabilistic notions. Because of the 
generality of the model with respect to network structure, these formulas 
are of necessity rather complex. Except in simple cases, they cannot be 
regarded as giving a final (or even a working) solution to the problem 
of calculating equilibrium probabilities. Still, they expose the mathe- 
matical character of the problem, and provide a badly needed starting 
point for well grounded approximations. For only after one has studied 
and understood this character can he seriously consider ignoring some 
of it in approximations. 


II, PRELIMINARY REMARKS AND DEFINITIONS 


Various combinatory, algebraic, and topological features of the con- 
necting network play important roles in the analysis of stochastic 
models for network operation. Some of these features will now be de- 
scribed, and terminology and notations for them introduced. 

Let S be the set of permitted (i.e., physically meaningful) states of 
the connecting network under study. It has been pointed out in earlier 
work! ? that these states are partially ordered by inclusion S, where 


xsy 


means that state « can be obtained from state y by removing zero or 
more calls. Also, these states can be arranged (in fact, partitioned) in an 
intuitive manner in a state-diagram, the Hasse figure for the partial 
ordering S. This figure is a graph constructed by partitioning the states 
in horizontal rows according to the number of calls in progress, the kth 
row consisting of all states with k calls in progress. The unique zero, 
or empty, state of the network, in which no calls are in progress, is 
placed at the bottom of the figure; above it comes the row consisting 
of states with exactly one call in progress, and so on. The figure is com- 
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pleted by drawing a graph with the states as nodes, and with adjacency 
matrix determined by the condition that states differing by exactly 
one call are adjacent. This means that in drawing the graph we place 
lines between states (in successive rows) that differ in point of one call. 
A maximal state is one that forms a summit of the state-diagram, i.e., 
has no states above it in the partial ordering. 

For most systems the state-diagram has the following heuristic de- 
scription: there is a unique “point” at the bottom corresponding to the 
zero state; there are usually many ‘“‘points”’ at the top corresponding 
to the maximal states; and the diagram is very ‘fat’ in the middle, 
because of the multitude of states with a moderate number of calls in 
progress. 

We mention at this point that usually the number of states, i.e., the 
number of elements of S, is astronomically large. Indeed, this fact has 
been a principal obstacle to theoretical progress on problems of con- 
gestion in large connecting systems. For an illustration, in the network 
of No. 5 Crossbar type, illustrated in Fig. 1, made for 1000 lines out of 
square 10 X 10 switches, the number of maximal states alone is 


(100 x 10!)* = 1.734 x 10%. 


The set of inlets of a connecting network is denoted by J, and the set 


CROSSBAR 
~~~ SWITCH 






Gust GNERS: ; . TRUNKS 


eee Ne el [ie et eh Bae 
LINE LINK FRAMES TRUNK LINK FRAMES 


Fig. 1 — Structure of No. 5 Crossbar network. 
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of outlets by Q. It is possible that 1NQ = ¢, that 1NQ # 4, or even that 
I = Q, 1.e., that all inlets are also outlets, depending on the “community 
of interest”’ aspects of the structure of the network. It is assumed that 
every call or connection is made only between an inlet and an outlet. 

If x is a state, the notation | «| (read “the norm of x’’) will denote 
the number of calls in progress in state v. If X is a set, then | 2 | will 
denote the cardinality of X, i.e., the number of elements of X. We 
define the levels 


Ly = {x e S: | ae | de}, k = 0,1, ---, max |2|, 
zeS 
as the sets of states in which a specified number of calls is in progress. 
The {Z,} form a partition of S, 


Uk, =S8 
k 


LN L; = 0, k A 7. 


The ‘“‘neighbors”’ of a state x are just those states which can be reached 
from x by adding or removing one call. These neighbors y of x can be 
divided into two sets according as y > x or y < x; so we are led to 
define 


A; = set of neighbors above x 
= set of states accessible from x by adding one call 
B, = set of neighbors below x 


set of states accessible from x by removing one call. 


III. SUMMARY 


The basic probabilistic assumptions that define the randomness in 
the traffic models to be studied are given precise statement in Section 
IV. They are, briefly, (z) the hang-up rate per call in progress is unity, and 
(iz) the calling rate per idle inlet-outlet pair is a constant \ > 0. Vari- 
ous operational aspects, such as the disposition of lost calls, and the 
method of routing, are specified and discussed in Section V. It is as- 
sumed that lost calls are refused without a change in state, and that 
routes for calls are chosen in a way that depends both on the call being 
set up or processed and on the current state of the system. In Section 
VI these probabilistic and operational assumptions are summarized in 
a transition rate matrix, Q. In Section VII, a Markov stochastic process 
x, (the mathematical model for the operating system) is defined, and 
the statistical equilibrium condition Qp = 0 for the stationary proba- 
bility vector p of x; is formulated. 
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In Section VIII, the probability b of blocking is defined as the (prob- 
ability one) limit of a ratio of counter readings, and a formula for b 
is given in terms of the stationary vector p. From this formula it is 
shown, in Section IX, that a simple algebraic relationship often exists 
between the blocking probability b, the traffic parameter \, the mean 
load carried, and the variance of the load carried. 

The remainder of the paper is devoted to the study and calculation 
of the vector p of stationary probabilities. Two explicit solutions, one 
algebraic and one combinatory in character, are given in Section X. 
In Section XI it is shown that the combinatory solution is a special 
case of a general formula for the stationary measure of an ergodic 
Markov process. The dependence of p = p(X) on the network structure 
and the method of routing is analyzed in an elementary way in Section 
XII. It is first shown that p(-)/po(-) has components that are analytic 
in a neighborhood of the nonnegative real axis, and so are expressible 
in the form 


pel u + €) _ = m F 
aes au én (aft). 


Yor » = 0 and e = X sufficiently small, this gives an expansion of p 
in powers of \. It is then shown that with || the number of calls in 
progress in state x, pz is of order \'*! as \ > 0. This result renders 
possible a recursive calculation (Sections XII and XIV) of the coeffi- 
cients Cm(2,0) from the partial ordering S$ of S and a matrix used to 
specify the method of routing. Once p is developed as a power series in 
X, a similar expansion is readily given (Section XIII) for the probability 
b of blocking. 

In Section XV, finally, we completely solve the problem of calculating 
the coefficients cm(2,\) for arbitrary values of \ > 0, giving each such 
coefficient both a combinatory interpretation, and an explicit formula, 
viz., a sum of products along paths through S which are trajectories 
for x, permitted by the routing rule. 


IV. PROBABILITY 


To construct a Markov process for representing the random trajectory 
of the operating network through the set S of states, we shall make two 
simple probabilistic assumptions. The traffic models to be studied em- 
body what has come to be known as a “‘finite-source effect,” that is, 
a dependence of the instantaneous total calling-rate on the number of 
idle inlets, and on that of idle outlets. 

In an attempt to describe this dependence in a simple rational way, 
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let us imagine a customer located at one of the inlets [outlets] of the 
connecting network, and seek to assign him a ealling-rate, assuming 
that he is in an idle condition. We shall suppose that the traffic he offers 
is homogeneous in the sense that he calls every outlet [inlet] at the same 
rate, or with the same frequency. Indeed, we shall assume that all 
customers offer homogeneous traffic. Now on most occasions when he 
is making a call, a customer does not know whether the terminal he is 
calling is busy or idle. Thus, if he is on an inlet [outlet] it seems reason- 
able to suppose that there is a probability 


AA + o(h) A> 0 


that he attempts a call to a particular outlet [inlet] (distinct from his 
own) in the next interval of time of length h, as h — 0, whether that 
outlet [inlet] is busy or not. The qualifying phrase “distinct from his 
own” is inserted to cover the case in which some inlets are also outlets, 
and in which it is reasonable to suppose that an idle terminal that is 
both an inlet and an outlet does not attempt to call itself. 

We therefore make these two probabilistic assumptions: 

(a) Holding-times of calls are mutually independent random varia- 
bles, each with the negative exponential distribution of unit mean. 

(b) If at time ¢ the network is in a state 2 in which at least one member 
of the inlet-outlet pair (u,v) eZ X Q is idle (that is, one of u or v is not 
involved in a call in progress), the time elapsing from ¢ until a call 
between u and v is attempted is a random variable having a negative 
exponential distribution with a mean 1/\, \ > 0. For different choices 
(u,v) and different occasions ¢, these times are all mutually independent 
and also independent of the call holding times. 

These assumptions can be rendered in the informal terminology of 
“rates” as follows: 

(2) The hang-up rate per call in progress is unity. 

(iz) The calling-rate between an idle inlet wu (outlet v) and an arbi- 
trary outlet v (inlet wu) with u ¥ visd > 0. 

Assumptions (a) and (b) provide all the “randomness” needed to 
construct our models. The choice of a wnzt hang-up rate merely means 
that the mean holding-time is being used as the unit of time, so that 
only the one parameter \ need be specified. 


V. OPERATION 


To complete the description of the traffic models to be analyzed we 
must indicate how the network is operated. Since in the present work 
we are taking into account only the network configuration, and omitting 
consideration of the control unit, it suffices to describe how calls to busy 
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terminals are handled, how blocked calls are treated, and how routes 
or paths through the network are chosen. 

It will be assumed that attempted calls to busy terminals are rejected, 
and have no effect on the state of the system; similarly, blocked attempts 
to call an idle terminal are refused, with no change in the state of the 
system. All successful attempts to place a call are completed instantly, 
with some choice of route. 

To describe how routes are assigned to calls, we introduce a routing 
matrix R = (rey), with the following properties: for each zx let II, be 
the partition of A, induced by the equivalence relation of “having the 
same calls up,” or satisfying the same “assignment” (of inlets to out- 
lets); then for each Y ¢ Iz, rzy for y « Y is a probability distribution 
over Y; in all other cases rz, = 0. 

The interpretation of the routing matrix FR is this: any Y e II, repre- 
sents all the ways in which a particular call c not blocked in x (between 
an inlet idle in x and an outlet idle in x) could be completed when the 
network is in state x; for y « Y, rzy is the chance that if this call c¢ is 
attempted, it will be routed through the network so as to take the 
system to state y. That is, we assume that if ¢ is attempted in 2, then 
a state y is drawn at random from Y with probability rz, , independently 
each time c is attempted in 2; the state y so chosen indicates the route c 
is assigned. The distribution or probability {r.,, y « Y} thus indicates 
how the calling-rate \ due to the call c is to be spread over the possible 
ways of putting up the call c. It is apparent that 


>, xy = number of calls which can actually be put up in state x 
yeAg 


= is(a); (“‘sucecesses”’ in 2), 
the second equality defining s(-) on S. This account of the method of 
routing completes the description of the traffic models to be studied. 
VI. TRANSITION RATES 


For the purpose of defining a Markov stochastic process it is conveni- 
ent and customary to collect the probabilistic and operational assump- 


tions introduced above in a matrix Q = (qzy) of transition rates, here 
given by 
(1 YE B, 
Afzy Ye as 
diy = 


—|z|—As(v) yor 
i otherwise. 


MARKOV PROCESSES 2803 


The number q.,, for « # y, has the usual interpretation that if the 
system is in state x, there is a chance 


Qayh + o(h) 


that it will move to y in the next interval of time of length h, as h — 0. 
Similarly 


1 — deh + o(h) 
is the probability that the system will stay in x throughout the next 
interval of time of length h, ash — 0 
VII. MARKOV PROCESSES 


In terms of the transition rate matrix Q it is possible to define a 
stationary Markov stochastic process {w;, —%° < ¢ < +} taking 
values on the set S of states. The matrix P(t) of transition probabilities 


Pa(t) = Pr{x: = y| a = 2} 


of x; satisfies the equations of Kolmogorov 
d 
a Pf) > QP (t) = P(t)Q, 


P(O) = J, 
and is given formally by the formula 
P(t) = exp iQ. 


Theorem 1: There exists a decomposition of the set S of states into a 
transient set F and a single ergodic set S — F containing the zero state; 
members of F have the property 


lim pz,(t) = 0 yeF,xeS8; 
{> 
on S — F there is a unique stationary (or equilibrium) distribution 
{pz, c« ¢ S — F} such that 
lim pzy(t) = py > 0 yeS—F,xzeS 
to 


2 PPeult) = Dy yeS—F, allt 
xzeS—F 


Proof: The existence of the unique ergodic set S — F follows from 
the fact that the zero state is accessible from every other state by hang- 
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ups. The existence and character of the limit of p.,(¢t) ast —~ © isa 
consequence of exercise 19, p. 436 of Feller,’ ie., of the fact that the 
characteristic values r of Q satisfy r = 0 or Re(r) < 0. (See also Bell- 
man,’ p. 294.) 

To prove the uniqueness of p, suppose that q is a different probability 
vector on S — F that also satisfies the “equilibrium” condition 


Do date = 0 yeS—F., 
xreS—F 


Then by Kolmogorov’s equation 


d 
Ti oD zDry(t) = e QxQrzeDey(t) = 0. 
dt xweS—F x,2eS—F 


Integrating from 0 to ¢, and using P(0) = J, we find 
Dy QePry(t) = dy ye S —F. 


zeS—F 
Since S — F is the only ergodic set, the left-hand side approaches p, as 
ti— o, Hence p = q. 

It is convenient to extend the dimension of p to | S| by adding zero 
components for states in F’, so that p.,(t) — py, 2 O for all ay ¢ S. 
The consideration of the transient set F is not just a mathematical fillip, 
since a “good” routing rule R may explicitly make certain “bad” 
states unreachable from the zero state, and thus place them in J’ to good 
purpose. 

In the notation of Halmos,° p. 65, the stationary probability vector 
satisfies the equilibrium condition 


Qp = 0. 


This is the classical equation of state, or equation of statistical equilib- 
rium, familiar in traffic theory. For our process 2; it takes on the rather 
simple form 
[|e] + 3()|pe = Do Py + 22 Pater, ES. 
yeag yYeby 
The left-hand side represents the average rate of exits from x, while 


the right-hand side is the average rate of entrances into x, in equilibrium. 
We define 


pee 2, Pe = Peli}. 


|zj=k 


Lemma 1: For1 Sk S w = max |x| 


reS 


tee 82). 


weLEe_1 


kpr 
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Proof: From the statistical equilibrium equation for x = 0 we obtain 


D5 Py = As(0) po ) 
yeAg 
which is Lemma 1 for k = 1. Assume that the lemma holds for a given 
k = 1. Summing the statistical equilibrium equations over x « Li we 
find 
kp, + 2d 8(@) D2 = 2d 2, Py +X »; b> Di yz + 
reLy TELE YeAg 


weLy yeBy 
The second sum on the right is the same as 


r Ds Dy. Do Tey 


yeLR_y zreAy 


and by definition, 
ay Tyz = S(y). 
reAy 


Hence (induction hypothesis) the second sum equals kp; . It is easy to 
see that in the first sum on the right each p, is counted exactly | y | times, 
i.e., (k + 1) times, since for a given y ¢ Zx4, there are exactly (k + 1) 
elements x ¢ L, for which y ¢ Az. Thus the first sum is 
(h+1) 2) py = (h+ Vp es 
yeLyyy 

and the Lemma follows by induction. This result could also be obtained 
from the general observation that the statistical equilibrium equations 
are equivalent to the principle that for any set X of states the average 
rate of exits from X equals the average rate of entrances into X. (See 
Morris and Wolman.’) 


VIII. PROBABILITY OF BLOCKING 


The fraction of calls that are refused because they are blocked, or the 
probability of blocking, is a quantity of particular interest to traffic 
engineers; they use it to assess the grade of service provided by an operat- 
ing connecting network. The rigorous theoretical calculation of blocking 
probabilities has long been an outstanding problem of traffic theory. 
This problem is outstanding in both senses of the word: it is conspicuous, 
and it is unsolved. In fact, not even the definition (let alone the calcula- 
tion) of the probability of blocking has received adequate treatment; 
for example, the otherwise monumental treatise of R. Syski® does not 
give a general account of blocking probability. 

Since it is desirable to have a close connection between theoretical 
quantities and their physical meanings in terms of measurements, we 
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shall approach the study of blocking probabilities by asking how these 
probabilities might be measured “‘in the field.”” The most natural method 
of measuring the fraction of blocked attempts seems to be this: to the 
control unit of the connecting system under consideration we attach 
two counters; the first will count up one unit every time an attempted 
call is blocked, and the second will register one unit every time a call 
is attempted; the ratio of the reading of the first counter to that of the 
second should, after a long time during which the system’s parameters 
remain constant, be an approximate measure of the fraction of blocked 
attempts. For mathematical convenience, one can then define the prob- 
ability of blocking to be the limit (as time increases without end) of this 
ratio of the counter readings. This mathematical definition was first 
proposed by 8S. P. Lloyd, although, of course, the ratio has been the 
practical definition for 50 years, being the ‘‘peg count and overflow 
ratio.” 

A precise mathematical version of this measurement procedure can be 
given as follows: on the same sample space as that of the process x; 
that describes the operating network, we define two additional stochastic 
processes {b(t), 4 2 O} and {a(é), t = 0} by the (respective) conditions 


b(t) 
a(t) = number of attempted calls in (0,¢]. 


I 


number of blocked attempted calls in (0,¢], 


These stochastic processes are the mathematical analogs of the counter 
readings. It is reasonable to use the limit 


b(t) 


tow a(t) 


of the ratio of b(-) to a(-) as a mathematical definition of the prob- 
ability of blocking, provided that the limit exists in a suitable sense. 
We show that this limit exists and is constant with probability one, 
and we give a formula for it. 


Theorem 2: The probability of blocking b, defined by 


_ 4. b(t) 
Oe aye 


exists and 1s constant with probability one; its almost sure value is 
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where p is the vector of stationary probabilities, and 
Bz = number of idle inlet-outlet pairs that are blocked in state x, 
a, = number of idle inlet-outlet pairs in state x. 


Proof: It can be seen that a(t) and b(t) can be written as sums over 


S, 


a(t) = Zs a,(t) 
b(t) = D7 bald) 


res 


where 


a,(t) = number of attempted calls made in (0,¢] with the system 
in state x, 


b.(t) = number of blocked attempts made in (0,t] with the system 
in state x. 


Now a blocked attempt occuring at an epoch wu such that x, = x does 
not change the state of the system. Such an epoch wu is a regeneration 
point of the process «,;. A successful attempt occurring at an epoch u 
at which x, = x does change the state of the system. The time interval 
from u back to the last previous epoch v at which a successful attempt 
occurred in state x, however, is independent of the behavior of x, for 
t > u; it depends only on the fact that the system left « by adding a 
new call, not on what new call it was, nor on where into A; x; went as 
this new call was completed. This can be seen as follows: we have 


aS SUP Se 


where 7 is the epoch at which x was last entered prior to u. Now 7 — v 
is independent of x, for ¢ > 7 if x49 is known to be x, because x; is a 
Markov process. 

Let U be an event measurable on {x,, ¢ > u}. Then 


Pr {U andu — 7 S p| tno = 2} 





= Pr{u—+r Sultro= 2} Dd ner Pr {U | tuo = y};, 
YEeAy 
where 
= | a | - As(a) 
_— < — a] ——— eee 
Bulan | tego ah (rr + rs(2) A + As(x) 


ae Zr dt. 
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Thus the time intervals 6, , 62, --- elapsing between successive blocked 
attempts in state v, and those q,, a2, --: elapsing between successive 
attempts in state 2, both form sequences of mutually independent, and 
except possibly for the first elements 6; and a, , identically distributed 
random variables. That is, the elements of each sequence are mutually 
independent, but the sequences are not independent since one consists 
of partial sums over blocks of the other. 

Both these sequences can be studied, then, in terms of a sequence 


1, 2, °+: of mutually independent random variables, all (except 
possibly 2) identically distributed. We define for = O and k 2 0 

So = 0 

Sn = Vi, C= @ or By 


7=1 


I 


n(t) =k ifandonlyif S, St < Sra, 
n(t) = az(t) or b-(t). 


It is now straightforward to show that {'n(t) approaches a limit with 
probability one, and to find the limit. Let us put, for? > S, 


nt) _ n(t) Suc 
t Sig t 


The first factor converges to {x2} with probability one, by the law of 
large numbers. The local suprema of 


b = Sra 
t 
for ¢ > S, occur at the points 
= & k=1,2,--:, 
and have the values 
t-37 k=1,2,--- 


Again, the first factor converges to E~'{x2} with probability one by the 
law of large numbers. Since E{ x2} < ©, and {a , k = 2} are identically 
distributed, 


> Prix, > ek} = >> Pr{ae > ch} < @ 
k=2 k=2 
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and it follows from the Borel-Cantelli lemma that for any « > 0 
Pr{a, > ek for infinitely many values of k = 2} = 0. 
Hence a, = o(k) as k — , with probability one, and with the same 


probability, 


—>1as t-.2. 


Srv 
a 


It follows that with probability one, 
lim {'a,(t) = E"{time interval between successive attempted calls 
t>% 


in 2}, 


lim ¢-*b.(t) 


t>0o 


E'{time interval between successive blocked at- 
tempted calls in 2}. 
Furthermore (cf. Ref. 11, p. 247, equation (1.2) and p. 249) 
© B (b(t) [vo = y} = Apyz(t)Bz 9 APrBr aS t— ow. 
However, by T'eller’s renewal theorem (cf. Ref. 11, p. 246), we know 
that 
lim t'E{b,(t)| 2 = y} = E”'{time interval between successive 
= blocked attempted calls in 2}. 
Hence, with probability one, 
tb.(t) > \pzBz as to. 
A similar argument shows that with probability one 
ta:(t) > \prar aS t— 


and completes the proof of Theorem 2. 


IX. A BASIC FORMULA 


Engineers have recognized (at least) four quantities as significant 
for the study and design of connecting networks carrying random traffic. 
These are the calling rate, the average load carried, the variance of the 
load carried, and the probability of blocking. In our model these quanti- 
ties are given respectively by 


2810 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


\ = calling rate per idle inlet-outlet pair 
m = >) |x| ps2 = average number of calls in progress 
zeS 


o 2d (12 — m)’p; 
b 


(p,8)/(p,e). 


It is natural to ask whether there exist any systematic relationships 
between these quantities, or between these and (possibly) other simple 
parameters of the network under study. Such relationships would be 
particularly useful and significant if they were largely independent of 
the structure or configuration of the connecting network, and were valid 
either for all networks or for large classes of them. We shall show that 
there often exists a simple algebraic relation among \, m, o’, and b. 
Its exact form depends on which inlets are also outlets. l’irst we prove 

Theorem 3: The probability b of blocking can be written as 


at. (1) 
NDE pron 


zeS 
or, in words, as 


a ae average load carried 
(calling rate per tdle) X (average number of idle pairs) ~ 


Proof: In equilibrium, the average rate of successful attempts must 
equal the average rate of hangups. Hence, intuitively, 


» 2, pis(«) = > | x \p2 = mM. (2) 


Since 6. = az — s(x), the result follows from Theorem 1. The actual 
validity of the identity (2) can be inferred from Lemma 1, by summation 
on k. 

Formula (1), rewritten in the form 


average load carried 


fe ee 
average rate of attempts’ 


should be viewed as a direct generalization of Erlang’s classical loss 
formula for c trunks, blocked calls cleared, and calls arising in a Poisson 
process of intensity a > 0. In that case the probability of loss is 


c 
a 


E,(c,a) = ct 


c j? 





Q 


nw 
ll 
° 
i) 
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and it can be seen that 


6 
| 
e 
1 
w, 


wR 
I 
° 
Se 
pases 


1 — Eile) = 





we, d 
° iMe 
S18 
Q we, 
we, 


g 
a or 
j=0 J! 
average number of busy trunks 
total calling rate 


ray 
No, 


a ee 


we, 





Q 


To exhibit useful special cases of the general formula (1) of Theorem 
3, we introduce a partial classification of connecting networks. A net- 
work is called one-sided if J = Q, i.e., if all inlets are also outlets; a net- 
work is two-sided if 11 Q = 49, ie., if no inlet is an outlet. 

Corollary 1: For a one-sided network of T terminals 


e 2m 
\ (T — 2m)? — (T — 2m) + 40?” 


Proof: For the one-sided network in question, we have J = Q, |I| = 
|Q| = T, and so 


= 
a, = 
2 


DS pote = 2{T? — (2T — 1)2m — T + 4m? 4 40%. 


zeS 


b=1 


Corollary 2: For a two-sided network with M terminals on one side and 
N on the other 


1 m 


~ (f= m) (N = m) Fo?’ 
Proof: It is clear that in this case 


az = (M — |x|)(N — |x]) 


b=1 


so that 
D> Dide = (M — m)(N — m) + 0°. 
zeS 


Each of the foregoing corollaries exhibits an explicit algebraic rela- 
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tionship between \, m, o, and b, based only on the one- or two-sidedness 
of the network. 

The preceding corollaries can be used to show that in a large system, 
the numerical value of the constant » will be small—indeed, of the order 
of the reciprocal of the number of inlets and outlets. This can be seen 
by the following heuristic argument, carried out for a one-sided network 
with 7 terminals: suppose that each terminal carries gq (0 S gq < 1) 
erlangs and that the blocking probability b is so small that we can ignore 
it and set 


_i1 2m _ 9 
\(T — 2m)? — (T — 2m) + 402 


Since the network is one-sided, any load carried by one terminal is 
also carried by some other terminal, and so 


b=1 





ql = 2m 
whenee 
Reavis oe 
£ (1-4 q) , 40°. 
gy Se Se ae 
( q) T 3 yr 
Because 
a= 2, pal | = m)° 
and 
1 
2 
= ee 
(je|—m) sz, 
we have 0 < 40°/7” S 1, and so 
const. 
AS 7 


with 





<= const. < ( 


q q . 
1 gee Cee) Lg 9) 


X. SOLUTION OF THE EQUATIONS OF STATISTICAL EQUILIBRIUM 


So far, we have shown that the theoretical determination of the 
blocking probability b reduces to that of the stationary vector p or, 
in many cases, to that of the mean m and variance o° of the carried 
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load. In either case, some knowledge of p is required. Most of the rest 
of this paper, therefore, is devoted to the calculation of p and to the 
study of its properties. 

In the past, the application of A. K. Erlang’s very natural method 
of “statistical equilibrium” to congestion in connecting networks has 
been visited by the curse of dimensionality, that is, by the extremely 
large number | S | of equations comprised in the stationarity condition 
Qp = 0. This difficulty has not only put explicit solutions apparently 
out of the question; it has even made it effectively impossible to reach a 
reliable qualitative idea of the dependence of the state probabilities 
{pz , x € S} on the structure of the network and on the method of routing. 

To be sure, it has always been possible in principle to solve Qp = 0 
by successive elimination of unknowns; however, when the dimension 
of p is of order 10” or so, this remark is hardly helpful. Since successive 
elimination can be used to solve Qp = 0 for any “ergodic” transition 
rate matrix Q, it neither elucidates nor uses any of the special features 
of the matrices Q that arise in problems of congestion in networks. 
Thus, even were it algebraically feasible, the method of successive 
elimination treats our matrices Q as indistinguishable from other matrices 
possessing a zero characteristic value. 

We shall give several explicit solutions of the equilibrium equations. 
One is based on purely algebraic considerations, and the others largely 
on combinatory and probabilistic notions. Because of the generality 
of our model with respect to network structure, the formulas appearing 
in the solutions are necessarily rather complex. Except in simple cases, 
they cannot be regarded as giving a final (or even a working) solution 
to the problem of calculating equilibrium probabilities. Nevertheless, 
they expose the mathematical structure of the problem and provide a 
badly needed starting point for well grounded approximations. lor only 
after one has studied and understood this structure can he seriously 
think about throwing some of it away in approximations. 

To describe the solutions in full detail, we need various preliminary 
definitions and conventions. 

It will be shown in a later paper’ that the minimum value of the 
blocking probability b is achieved by a routing matrix R consisting 
entirely of zeros and ones, i.e., by a deterministic rule. So it is assumed 
henceforth that R& has only zeros or ones for entries. 

A path on S of length 1 = 0 is any ordered sequence 2, %1, -°- , 7 
of (f + 1) elements of S. A lower ease pi, 7, will be used as a symbol 
for a generic path on S, and we write 


w = {%,%1, °°, tt, L= I(r) 
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to indicate that 7 is a path of length [(7) consisting of %, %, °+* , 1 
in that order. Note that paths of length zero are countenanced. A path 
x is a loop if 


% = V1, 
and also 
Ly FU; 


whenever 0 <i <7 S I(x). A loop of length zero is a path of length 
zero. If w = {%, 21, °** , Xo} is a loop, each element 2, 2%, --- ete. 
will be spoken of as being on z. 

The elements x and y of S are called adjacent in the graph of (S,S), 
i.e., in the state diagram, if one of the following equivalent conditions 
holds: 


(7) x covers y or y Covers 
(it) ye Arorve Ay 
(iit) x and y differ by exactly one call in progress. 


A path on S is called continuous if successive elements of the path are 
adjacent. 

In order that x; have positive probability of following a path 7, it is 
not enough that 7 be continuous. For evidently the action of the routing 
matrix R (assumed to consist solely of zeros and ones) is to prohibit 
certain paths on S as (parts of) possible realizations of the process 
x,. Here ‘possible’ of course means “having positive probability.” 
There exists then a class of those paths that are permitted by R, definable 
in several ways. One such way is as follows: A path w = {2 ,%1, °°: x} 
on S is permitted by R if for each 7 in the range 1 $7 S 1, 


t¢é Ba. 4 or Tr;32, = 1. 


The set of paths permitted by R is denoted by P. 
With X a subset of S, 


perm(X ) 


will denote the set of all permutations of X, 1.e., one-to-one maps of X 
onto itself. We let 


Yr, Y2,°°* Ys} 


be an arbitrary simple ordering of S, and we define the ordinal number 
w(x) of a state x e S by the condition 


a(c) =n ifandonlyif += y,, n=1,2,---,|S|. 
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For each m,n in the region 1 S$ m,n S | S |, we define a function ¢mn(-) 
on the domain 1 S$ 7 S | S| by the condition 


(i 1 <i < min(m,n) 
1=m or t=" 
Sma(t) = . ; : 
a—1 min(m,n) <2t< max(m,n) 
i—2 max(mn) <is | S|. 
We observe that €mn(-+) has an inverse for each m and n. Now let ¢(-) 


be a permutation of the set of states with the mth and the nth removed; 
then 


i Emnlole(Ytmn ci) ]} i=1,2,::-,|8|-2 


defines the permutation Gn.(g) associated with g. Also, sgn Amn(y) is 
+1 or —1 according as the permutation @mn(g) is even or odd. 
The “hang-up” matrix H = (h,,) is defined by the condition 


1 if ye B, 
Reg = 
0 otherwise. 
Let x and z be states, and suppose that t = {x, --- , 7} is a path 


in P beginning at z and ending at x, so that 7% = zand 2; = «x. Suppose 
also that the trajectory represented by z contains m new calls, i.e., 
there are exactly m values of 7 in the range 1 S 7 S / such that 


Uv; € 5 Cone . 


Since z starts at 2, in which | z| calls are in progress, and ends up at 2, 
in which there are | 2 | calls in progress, it is evident that 


I(r) = 2m + \z2| —|a|. 


The set of paths which start at a state z, never return to 2, and end 
up at « ~ z, is denoted by 


eax 


Thus 7 belongs to K,. if and only if a = 2,7; # zfor0 <i U(m), 
and x; = &. 


Let + = {2 ,%1, +++, 21} bea path on S, and let f(z,-:) be a function 
defined for x , 21, ++: ,%. In terms of f(z,:) and z, we define a product 
along the path z by the expression 

1(7) 
I] f(r, x3). 


W={2q,21,+++, 1} 
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It is convenient to abbreviate this product by the readily understand- 
able expression j 


I /(r,). 


In the special case that f(7,-) has the form 
(GSI a) PEG eT 
we abbreviate the product by 
JI h(pv,z). 


The notation p- is supposed to represent the predecessor of an element 
in 7. 

The first and simplest solution of the equilibrium equation Qp = 0 
to be given is based on an observation made by I. W. Sandberg, namely, 
that det(Q), and hence det(Q’), are zero, so that Q’adj(Q’) = 0, and 
thus columns of the matrix of cofactors of Q should give solutions of 
Qp = 0. The author has not succeeded in elucidating the probabilistic 
significance of these simple algebraic facts. It will be seen later that the 
other solutions to be given are, on the other hand, natural, plausible, 
or even obvious from a probabilistic viewpoint, but are algebraically 
involved. 

Theorem 4: Let m be an integer in the range 1 S m S | S|. An unnor- 
malized nonnegative solution p of Qp = 0 ts given by 
= 1) |S] —1+m-+n 


SEN Amn(y) 
ye perm (S—{ymn}) 


LC |e] As(z)) TL eee + APz(2))- 


gy (Zz) =2 
Proof: Since det(Q) = 0, it follows that det(Q’) = 0, the prime indi- 


cating the transposed matrix. Hence (Birkhoff and MacLane,” p. 290) 
no matter what ordering of S is used. 


Q’adj(Q’) = 0, 
where ‘adj’ denotes the adjoint matrix, i.e., the transposed matrix of 
cofactors. Let C = (czy) be the matrix of cofactors of Q corresponding 


to the ordering y; , Ye, °°: , yj s; of S, and suppose that the entries of Q 
are also arranged according to this ordering. Then 


C = adj(Q’) 


Pun = ( 


and we find that 
= Gexlzy = O. 
zeS 
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Thus any column of the matrix C of cofactors of Q gives a solution of 
the equilibrium equations. It follows from a result of W. Ledermann 
(Bellman,’ p. 294, exercise 10) that 


ean: e Ve ee = 0. 


We see that all the cofactors c,, have the same sign, and each column of 
the matrix C yields a nonnegative solution of Qp = 0. Hence all columns 
are proportional, because there is only one nonnegative solution, up to 
normalization. The theorem follows from the standard formula for a 
cofactor as a determinant. 

Theorem 5: If z « S ts any state, then an unnormalized solution of the 
statistical equilibrium equations Qp = 0 is given by 


pz = 1 
and for x # 2, 
> L(r)-+|2|—|21 1 
2 = » 2 a, 
ee eine ver |y | + As(y) 


Proof: The formula given can be verified by direct substitution in the 
equations 
[As(z) + | 2 [lpe = De Pu td Do Dive ve S. 
yeaAy Yee 


‘Convergence of the infinite sum will follow from our Theorem 7 and 
exercise 19, p. 378 of Feller.‘ 


XI. STATIONARY PROBABILITY MEASURES FOR ERGODIC MARKOV PROCESSES 


In order to shed light on Theorem 5 (and also to prove it by a prob- 
abilistic argument) we shall consider in this section the general problem 
of calculating the stationary probability measure of an ergodic contin- 
uous parameter Markov process on a finite number of states. Our object 
is to give an explicit formula for the measure in terms of the transition 
rate matrix. Again, it is needless to mention that a formula of such 
generality must be fairly complex. Applied to familiar Markov processes 
whose stationary measures are well known, the formula to be given 
yields some unexpected combinatory identities, not pursued here. 

We shall now use the notations 2,, S, Q, and P(-) to describe an 
arbitrary Markov stochastic process 2; in continuous time, taking values 
in a finite set S of states with transition rate matrix Q = (qz,) and tran- 
sition probability matrices P(t) = (pz,(t)), t real. It is assumed that 
there is a single ergodic class of states. Such a general interpretation of 
notations already introduced (for specific processes describing traffic in 
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connecting networks) is made to avoid defining new terminology; it is 
made in this section only, and should cause no confusion. 

If zis a state, a return to z is defined to be an epoch of time at which 
x, reaches 2, 1.e., wu is a return to z if for some « > 0, x; ¥ z for u — e€ 
<t<uanda2, = zforu<t<u+te. A departure from z is an epoch 
of time at which 2; leaves z, i.e., wis a departure from z if for some e 
> 0,a%, = zforu—e<t < wanda, ¥zforu<t<cuteA 
return time to z is a period of time elapsing between a departure from 
z and the next return to z. We set, for ¢ = 0, 


H,(t) = E{number of returns to z in (0,] | vo = 2} 
uz= E{return time to 2} 
Ge= —dz = E™{length of a stay in 2} 


The notation H,(-) has been chosen because the defined quantity has 
an obvious resemblance to the classical renewal function. (See Smith.”) 
There is a simple relationship between the equilibrium probability of 
a state x, and the quantities u, and q,; this is expressed in the next 
theorem which, though probably familiar, is included for completeness. 
Theorem 6: For x € S, pz = [1 + qruz]- 
Proof: The transition probability pz:(¢) approaches pz as t > ©, 
and is expressible as 


t 
Drx(t) =e =F | ao dH,(w). 
0 


Since stays in x and returns to 2 are all mutually independent, the stays 
being identically distributed, and the returns also, the renewal theorem 
[Smith,” p. 247, formula (1.3)] implies that the right side approaches 


© tet dt 
i 1 


zi interval between successive ~ 1 qua 
returns to x : 


Thus p, can be calculated from uz where 


t 
Me = i u dPr{return time to « S u} 
0 


= i? Pr{return time to x > u} du. 
0 


For our purposes it is convenient to approach the calculation of p; 
in a slightly different way. Let z be any state, and let x be a state dis- 
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tinct from z, 2 ¥ x. Define q..(¢) in t = 0 to be the probability that if 
the stochastic process start at 2 at time zero, it be at x at time ¢ without 
having returned to z. Thus 


Qex(t) = Pr{x,; = «x and (epoch of first return to z) > t| xo = 2}. 


For convenience, we set q.:(t) = Oint = 0. 
Lemma 2: For z # x,t = 0, 


pt) = | ai SuanCl). 


Proof: Let t;, 7 = 1, 2, ....be the epoch of the ith return to z in 
t > 0, and let A;(t) be the event 


{vs = 2@ and t; < t< tesa}. 


Then 
t 
Pr{A,(t) | x0 = z} = / den (t = u) dPr {t; <u | mM = By 
0 
However (cf. Ref. 11, p. 251, formula (1.7)), 


H(t) = Pr{t; S t| x = 2 


| 
Ms 


i] 
Be 


v 


and 


Ms 


Per(t) = D, Pr{Ac(t) | ao = 2}. 


v 


I 
Pm 


The integration and the summation can be interchanged by the mono- 
tone convergence theorem, and the lemma follows. 
Lemma 3: For z # x, 


oe OE ie ee(u) du. 
Dee Tag. 


Proof: The integral on the right exists, since 
/ Qez(u) du = E{time spent in x between successive returns to z} S uz. 
0 


The lemma follows from Lemma 2 and the renewal theorem. 
The matrix A is defined by the condition A = (az,) with 


dey wry 
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It can be verified that A is a stochastic matrix, indeed, the one-step 
transition probability matrix of a Markov stochastic process {zn , n an 
integer} taking values on S; 2, is a discrete-time analog of x; obtained 
by ignoring the lengths of time spent in a state. 

Lemma 4: For z ¥ x, 


1 number of arrivals at x between 
Qer( Ut) du=—EH : . 
0 successive returns to 2 


Proof: The integral is the expected time spent in x between successive 
returns to z. Each stay in x has mean length 1/q;, and the stays are 
independent of the rest of the trajectory followed. 

Lemma 6: For z # x, 


R number of arrivals at 2 between| 
successive returns to z 


= >) Prix, = x and 2; #z for 1 Sj S nia = 2. 
n=1 


Proof: We remark that the expectation on the left is the same for 
both x; and a, . The lemma is then a special case of the theorem that if 
{A;,7 = 1, 2,...} are any events, then the expected number of A; 
that occur is 


> Pr{ Aj. 
Lemma 6: 
we =f gala) du 
Proof: This is an immediate consequence of q-.(-) = 0 and 


| Qex(u) = H{time spent in x between successive returns to 2}. 
0 
for z ¥ x. 
Lemma 7: Let x # z. Then 
Pr{g, =x and 2; #2 for 1873 | = 2} 
a Ge Youn 
Qz TeKzz yew Qy 
l(r)=n 


Proof: The event in question can occur in as many ways as there are 
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paths of length n in K,,. The probability that {z;,0 S$ 7 S n} followa 
path w from z to x is 


de 7] Geuy 


’ 
Qzyer Qy 


the quotient in front correcting for the end points. 

Combining Theorem 6 with Lemmas 4, 5, 6, and 7, we obtain the 
following explicit formula for the stationary probabilities: 

Theorem 7: If x # 2, then 


Dz = 1 py.u 
lL + Qelz ern vem Yy 
with 
Qzbz = » ps II aes ) 
wAx2 TER zy YET il @y 
and 
Pee ae 
° 1 + QzMz ° 


We remark that Theorem 5 follows from the above if we choose z = 0 = 
zero state, omit normalization, and observe that only products along 
permitted paths (7 « P) are nonzero. Theorem 7 is an analog for con- 
tinuous parameter processes of a theorem of Derman” for Markov 
chains. 


XII. EXPANSION OF THE STATIONARY VECTOR PD IN POWERS OF d 


We now turn to examining, in an elementary way, the analytical 
dependence of the state probabilities {pz , 2 « S} on the calling rate X, 
on the structure of the network, and on the routing matrix R. It will 
be shown that the partial ordering < of the set S of states can be used 
to calculate the elements of p by expanding the ratios 


Ps x>0 

Po 
in powers of the traffic parameter \ in a neighborhood of \ = 0, and then 
determining the coefficients of this expansion from the structure of the 
network and the routing matrix by a recursive procedure. The solution 
so obtained is later (Section XV) extended to arbitrary real positive 
values of \ by analytic continuation, and the coefficients are calculated. 
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Our approach to studying the stationary probability vector p will be 
guided by these intuitive remarks: it is known that in various simple 
models (of connecting systems carrying random traffic with blocked 
calls refused) the probability that k calls be in existence is proportional 
to the kth power of a constant associated with the calling rate divided 
by k factorial. For example, in Erlang’s model for c trunks with blocked 
calls cleared, the chance that k calls are in progress is proportional to 


k 


a 
kl? 0O<k 


IA 


Cc 


where a is the calling rate. Note that the exponent of a is the number of 
calls in progress, i.e., the current difference between the cumulative 
number of new calls and that of hang-ups, assuming that the system 
started in the zero state. The factorial in the denominator is the number 
of orders in which the & calls in progress could all hang up, or alterna- 
tively, could all have arisen. 

The situation in our model is very similar. Each call still in progress 
required an event occurring at the rate \ to put it in existence; for each 
state x, there are exactly ||! orders in which the || calls in prog- 
ress in x could arise, or terminate. These circumstances suggest that for 
« > 0, pz might be of order \!*'!as \—> 0, and that the coefficient of \'*! 
in p, might involve |x|! in the denominator. These conjectures are 
true, and are the first step in the systematic calculation of pz by ex- 
pansion in powers of X, to be carried out in this section. 

We first record some analytical properties of p as preliminary results. 
Some of these results could be obtained as consequences of the basic 
solutions given in Section X. Most of the proofs to be given, however, 
are independent of Section X, and proceed by simple arguments from 
the equilibrium equation. 

When we need to view pz as a function of the parameter \, we write 
D2 = p2(d), x € S, or in vector form, p = p(A). 

Lemma 8: tim pi(\) = bz. 


Proof: Let x be a maximal state in the partial ordering < of the set 
S of all states. Then s(x) = 0, and 


| x | p2(A) = rd, Dy(A)Tyz « 
Yee 
Since 0 S p,(A) S 1 for all \ > O and all y e S, the lemma is true for 


maximal states. Assume, as a hypothesis of induction, that the lemma 
is true for all y with |y| 2 &+ 1. Then forre L,,k > 0, 


[|x| + As(x)Ip.(A) = 2 Dy(A) +2 2d Py(A)Tye 5 
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and so pz(\) — 0 as \ — 0. The proof is completed by observing that 
for each A > 0, 


pod) = 1 ap p:(d). 


Lemma. 9: For each x ¢ S, pz ts the restriction to real positive argument 
of a rational function pz(-) of a complex variable wp. The function pz(-) 
has no poles in a neighborhood of the half-line Re(u) = 0, Im(u) = 0, 
and an expansion 


pilute) am 

Dol 6) ~ Sy ome) 
with real coefficients Cm(z,\) is valid for Re(u) = 0, Im(u) = 0, and 
| e | small enough. 

Proof: The equation Qp = 0 can be solved for a normalized (i.e., 
probability ) vector p(\) by successive elimination or by use of Theorem 
4, Kither procedure gives rise to an algebraic expression for p.(d), 
x ¢e S. Let p.(u) be that rational function of a complex variable yu de- 
fined by substituting » for \ in this algebraic expression. Since 0 S 
px(\) S 1, pz(u) has no poles in a neighborhood of the nonnegative 
real axis. To justify the expansion we show that p.(-)/po(-) is also 
analytic in that neighborhood. But this is immediate because by Lemma 
8, 


pi(r) — do2 as AO, | 


and by Theorem 1, po(A) > 0 for \ = O because the zero state belongs 
to the ergodic class S — F. 

Setting » = 0 and e = 2d in Lemma 9, we obtain an expansion of 
p2/po in powers of the traffic parameter , 


pz(A) _ - m r 
po(r) = Py Nem (#,0); 


valid for \ small enough. 
Theorem 8: Fork = Oand a ¢ Ii, 


= Pe = O(X*) as 4 —> 0, 
xceLy 





and 
pr = O(K) as A> 0. 
Proof: We prove both results simultaneously by induction. By Lem- 


ma 8, the result is true for k = 0. Assume that it is true up through 
k —1 = 0. From Lemma 1 and the induction hypothesis, we find 
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fk 
Pi = i » Dys(y) 
vV yeLR_y 
=.O0(*) as \— 0. 


By Lemma 9, for \ small enough p, can be written as a power series 
around \ = 0 


De = DX Cala,0): 
m=0 
Thus 


SA" DS em(x,0) = pe = O11) as XO. 
m=0 zeLy 

The first nonvanishing coefficient in the expansion on the left must be 
positive, else p, < 0 for \ > O small enough, which is impossible since 
p(X) 2 OfordrA > 0. Hence 

ye €o(x,0) 2 

However, the first nonvanishing coefficient in the expansion of p, must 
also be positive, for the same reason as above, namely, that p.(A) = 0 
ford = 0. Thuseo(x,0) 2 0. Hence p, = O(x*) as\ > 0 implies ¢9(2,0) = 
0. We apply the same argument successively to show that for a ¢« L;, 





the coefficients ci(7,0), ..., cx1(2,0) are all zero, and the theorem is 
proven. 
Theorem 9: For x > O 
rd x 
pe = metre + of") as A—- 0 
where 
%, = CR Na: 


the 0,x entry of the |x| th power of the routing matrix R 
the number of permitted strictly ascending paths from 0 to x. 


Proof: The equation of statistical equilibrium that defines p is 


[| 2] + As(x)]p = 2a Pu td Le Pobve wes. 
yeBe 
For convenience, suppose that |x| = k. We divide the equation by 


Dy, use Lemma 9 to expand the components of p/p in powers of A, and 
equate the coefficients of \“ on each side of the equation. This gives 


kex(x,0) + s(t)era(z,0) = 2, cx(y, 0) + 2, Ce-a(Yy,0)T yx - 
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By Theorem 8, cx-1(v,0) = 0 and c.(y,0) = 0 for y « Az. Therefore 
ken(2,0) = 2) c1(y,0) rye ; 
yeBy 


or in general, with y the vector with components ¢),;(y,0), 


] 
C\2\(2,0) = tela Cy (y,0) ry 


1 

= iz] Cy (Y,0) Pye 
1 

= [a | (Ry): 


Iterating this relation |x | times, we find 


1 


C\x\(2,0) = [x]! Ge Yes 
Now it is easily seen that the y,x entry of R’ is zero unless k = |x|] — 
|y |, and in particular, if k = |x |, this entry is zero unless y = 0. Thus 
1 2 
Cz} (7,0) =; aT (R! ') ox€o(0,0), 


jx]! 


and it is obvious from the definition of the c,.(y,0) that co(0,0) = 1. 
Theorem 10: Let the sequences {cn(x,0), m = 0, x € S} be defined re- 
curstvely by 
Cm(0,0) = 6 m0 
Cx(z,0) = 0 for OSm<|z] 


and x«>0 


Te 
Ja]! 


| x | em(x,0) + s(x)ema(@,0) = 2) em(y,0) + D> Cm1(y,0) bye 


yeAg YEeBy 





Cj21 (2,0) => 


for m>|x| and «> 0. 


If forx > 0 
\ < (lim sup | em(a,0) [/”") 7! 


then the component pz of p 1s gwen by 


De = Do D2 d"Cm( 2,0). 
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Tf 
\ < min (lim sup | ¢m(2,0) |"), 
x>0 m>o 


then the probability po of the zero state 1s determined by the normalization 
condition >> pr = 1as 
zeS 


an 
1+ >> DS Aen (2,0) 
z>0 m=0 


Proof: This result follows immediately from again 9 and Theorem 9, 
using the standard formula for the radius of convergence of a power 
series. 


XIII. EXPANSION OF THE PROBABILITY OF BLOCKING IN POWERS OF \ 


With a method of calculating equilibrium state probabilities for small 
\ at hand (in principle, at least.) we now show how the probability b of 
blocking can be calculated, to any desired degree of accuracy, by an 
expansion in powers of the traffic parameter \, assumed sufficiently 
small. In most connecting networks of practical interest, none of the 
states near the bottom of the state-diagram has any blocked calls, so 
that it is necessary for a state x to have certain minimum number of 
calls in progress before it can have any blocked idle pairs. To take ad- 
vantage of this situation in our calculation, we let 


= least k such that some call is blocked in a state of Ly, . 


Theorem 11: The probability b of blocking can be expanded in a power 
series in X in a neighborhood of = 0; only terms of order higher than or 
equal to \” appear. 

Proof: From Theorem 2 we have, since 6: = 0 for |x| < n, and 
cn(@,A) = Ofork < |a|, 


a P2Bx 


bo Lele! 


2 PzrOx 
(3) 


MDS V 4 ms 0) Be 


DEY or c; (2,0) ax 
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Since the denominator is not zero in a neighborhood of \ = 0, b = b(A) 
is analytic there and can be expanded in powers of \. Up to terms of 
order \”*” this expansion is : 
, / 
A(0) [A (0)] 
4: ay (ee — A”(0)B(0) 
2[A (0) 


_ A'(0)A(0)B'(0) — An RO) 
[A (0)}° 


ot 0 Crs): 
The coefficients in the first two terms can be obtained by the following 
calculations: 


A(0) > >» €o(2,0) ae 


1zl=o 
> €o(0,0) ao 
— Qo , 


B(O) = pe Cn (2,0) Be 


zlan 


= » Cn(X,0) Bx 


|zi=n 


1 
= peal T2Bx 3 
Ns rely 


A'(0) = >t a(z,0)az 


jz} =1 


= Ds (2,0) ae 


J2|=1 


= De ToxQz 3 


weLy 


BY(0) = Dd. enas(2,0) 8: 


jel=>n+1 


— ye Cn41( 2,0) Bx + pe Cn41(X,0) Bz 


veLnsy 


: >) Tbe + 2 €n41(2,0) Be « 


Gc 1) tae 


The constants {¢x41(x,0), |x| = &} can be determined by the following 
recurrence, obtained from Theorem 10: 
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(0 0) = 0, 


Cn4i(2,0) = > tet Dd Crnly,0) tye — satel 


a a 1)! ! yeAx yeBy 

Our results can be put in a slightly more explicit form by expanding 
log b rather than b, and using the fact that A(-) and B(-), as defined 
by (3), are generating functions. We have 


log b = n log \ + log B(A) — log AQ). 
Except for the systematic absence of factorials, the coefficients in the 


expansion of log B(\) are related to those in the expansion of B(\) as 
cumulants are to moments. Set 


by = De eel), FH OL... 
a; = 24 Ci( 0) ae pee, india 
so that : 
BQ) = oy db; , 
A(A) = >» a; « 
Then, by a standard formula (Riordan,” p. 37), 
log BIA) = oy 72;(b), 
log A(A) = = Nk;(a), 
where for u 2 a or b (sequences) 
MES or 2s 


(ki)! +++ (kn)! ? 


with k = ky + kp + --: + k,, and the sum over all partitions of n, 
i.e., all solutions in nonnegative integers of ky + 2ke + +--+ + nk, = n. 


XIV. COMBINATORY INTERPRETATION AND CALCULATION OF THE CON- 
STANTS {Cm(z,0), c¢ S,m = 0} 


We shall now evaluate the coefficients in the power series expansion of 
p around \ = 0 explicitly as sums of products on paths in S. Additional 
combinatory notions that enter this calculation are discussed first. 
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A path is said to contain a loop if it returns (one or more times) to a 
place where it has been previously. Thus 7 = {a,-°-- , x} has a loop 
if there are integerszandj,0 S17 <j7 Sl, such that 


Uy = Uj. 
A circuit is a path that ends where it begins. Thus the generic path 


w= {Xo,..., 2} isa circuit if v% = x. A circuit is a loop if it contains 
no subcircuits, i.e., 7 is a loop if x = x,and 2 ¥ 7 implies 


Le ZL; 


foranyO $2: Sland0 <j <l. 

A eireuit of length two is of necessity a loop. Furthermore, the cir- 
cuits of length two on S can be partitioned into two classes according 
to the direction in which they are traversed. Each such loop can be 
thought of as obtained either by first adding a new call and then re- 
moving that same call, or by removing a call and then replacing that 
same call. Thus if r = {20 , 21, 2} is a loop of length two on S, then 


XY = Xe 
and either 

v2 = XY € B., 
or 

Le = Xp € Az, 


and, of course, not both. In the first instance we say that 7 is of the 
first kind. Thus a loop of the first kind is a path a of the form {z,y,2} 
with y ¢ Az, i.e., it is a trajectory in S obtained by starting at a state 2, 
adding a new call to go to a state y, and then removing that very same 
call to return to x. 

For a path w = {a,..., 27} and a state x we say that x is on wif x 
is one of a, 21, ..., 1. If x is on z, we say that a loop of the first 
kind on zw ends at « if for some 2 in the range 2 <7 S l, 


UzH= i2 = 72 
and 


that is, the subpath {x:2 , xi1, x. is a loop of the first kind, and 2; is x. 
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With 7 and x as above we define 


Valls, if x is on w and a loop of the first 
g(7,c) = kind on z ends at x 
1, otherwise, 


and we denote by v(z) the total number of loops of the first kind con- 
tained in 7. 
With these combinatory preliminaries behind us, we are ready to 


prove 
Theorem 12: The coefficients {¢m(x,0), x « S, m = O} are given by 
Cm(0,0) => Smo » 
and for x > 0, 
Cm 0 — —1 Sa, g(my) . 
aD) Pitre ae Il ly | (4) 


Tv 
U(r)=2m—|z| 


Before proving the theorem it is probably helpful to state it in words, 
thus: To calculate cm(x,0), consider all the paths 7 that are permitted 
by R, start at 0, never return to zero, end up at 2, and have length 
2m — |x| (ie., consist of m new calls and m — | zx | hangups); along 
each such path a take the product of the reciprocals of the numbers of 
calls in progress in the states traversed by z, omitting states at which a 
loop of the first kind ends; weight the product positive or negative 
according as 7 has an even or an odd number of loops of the first kind; 
add up all the weighted products. 

Proof of Theorem 12: We already know that ¢m(2,0) = 6mo and that 
Cm(x,0) = Oforz > Qandm < |x|. The latter result is consistent with 
Theorem 12 because no path from 0 can reach x in fewer than | zx | 


steps. Consider then the case m — |x| 2 0. Any path z from 0 to x 
of length 2m — |x| = m consists entirely of new calls, and for such 
am 
glay) _ 1 
ver |y| —[a{! 


The number of such paths, summed over in (4), is easily seen to be 7, , 
the number of permitted strictly ascending paths from 0 to x. Thus for 
m = |2a|, (4) states that 


Tr 
| x |! 





Cm(x,0) = 


as was proven in Theorem 9. 
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The remainder of the proof is by upward induction on m and finite 
downward induction on 2, using the recurrence formula 
| 2 | m(2,0) + s(x) ema(2,0) = 2, em(ys0) + 2X, Cma(Y,0) Tv 
year yeBy 


given by Theorem 10. The starting step of the induction is the fact that 
formula (4) holds for x « S and m such that 


0< |«|SmSt. 


This is a consequence of the fact, already proven, that (4) holds for 
|x| =m. 

Let us assume, as a hypothesis of induction, that the theorem holds 
for all x e S and m such that 


0< |«|Smsk. (5) 


We shall prove from this, by downward induction on 2, that it also 
holds for x e S and m such that 


0< |jex|Sm=k4+1. 


This last condition exactly describes the new cases covered in extend- 
ing (5) tok + 1. 

Where z is a path, we use the natural notation rz to denote the path 
obtained from a by adding x to 7 as a new ultimate element, assuming 
that 2 is adjacent to the last element of +. We now observe that if rx 
is a path that does not end in a loop of the first kind, then g(rv,x2) = 1, 
and so 








g(r2x,2) ee ee g (1,2) 
ere ie |ulzer Jz] ’ (6) 
v(x) = v(m.) 


A state x is maximal in the partial ordering S if no new calls can be 
put up in x, for whatever reason. If x is maximal, then A, is empty and 
s(w) = 0. A state x is maximal in (a set) X © Sifax = y for everyy ¢ X, 
and x eX. 

LetO S$ |va| S m= k + 1Landsuppose first that x is maximal. Then 


Cm(x,0) = rz] DS Cm—1(y,0) rye - 


yeBy 


No path ending at a maximal state x can end in a loop of the first kind, 
and any such path must have a y e B, with ry; = 1 as a penultimate 
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element if it is to be a permitted path. Then clearly 
fre PN Koz: U(r) = 2m — |x|} 
= Ulru:re PQ Ky 


yeBy 
and I(r) = 2m — |x|—1 and rye = 1}. 


Let now ye Bz, 7€P 1) Koy, Ur) = 2m — |x| — 1, andr, = 1. 
Then ra satisfies g(rv,7) = 1 and also formula (6). Thus formula (4) 
holds for maximal x and 0 < |x| S m= k +1, by the hypothesis of 
induction. 

Next, consider states x that are maximal in the set 


{ye S:0 Ss |2 |S 4+ 1}. 


These are just the elements of Lx41, i.e., the states x with | «| =’ + 1. 
Since we are assuming m = k + 1, the result (4) holds for these a by 
Theorem 10. 

Finally, assume as a hypothesis of downward induction (on |< | ) 
that the result is true for y « S and m such that 


1<jt+ils jy|[sSm=k 41, 
and suppose that | «|= 7. Then 


Cm (a 0) =o » lem(y,0) — fajlm ale: 0)] + =) ee Cn—1(Y,0) Pyz « 


5 yeAy 


If + is a path on S the notation pe(a) denotes the penultimate ele- 
ment of aw, ie., pe(r) = 214 for r = {2, %,..., 2. The notation 
ape(z) denotes the antepenultimate element of 7, 1.e., for r = {2, 21, 

zi}, ape(wr) = 22. 

A path of length 2m — |x| belonging to P M Koz reaches 2 either 
via A, or via B, . In the latter case the path cannot end in a loop of the 
first kind. By the hypothesis of induction, 


1 (=1 v(m) 

Hy Ove Re OS yy ht. 

| a | yeBy yeBy weP (\Koy i c: x | zer | vA | 
1(7)=2(m—1)—|y| 








(7) 
= 1)’ ee 
win, (OU 
l(r)= om—12| 
pe(m)eBy 


the second equality following from (6). 
Now consider a path rx of length 2m — ||, belonging to P M Koz, 
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reaching 2 via A, , and not ending in a loop of the first kind, i.e., with 
pe(a7) # x. Using the hypothesis of downward induction, we find 


ab > ys ~1)" a 





| yed weP{| K zen 
| | x oy 
iy. 
Tv 
( ) (8) 
z 
= = fy” J T5€) Ty ; 
Be ir 
pe(m)eAyz 
ape(r)Az 


the second equality again following from (6). 

Finally, we consider those paths of length 2m — |2 | belonging to 
P 1) Koz which do end in a loop of the first kind. Such a path is of the 
form 














YX 
with t ¢ PN Koz, Ur) = 2m — |x| — 2, andr, = 1, for some 
y « A,. We observe that in this case |y| = |x| -+ 1 and that 
eu | g(ry,2) ata) oo 1 g (7,2) 
Aas eee ’ ae s 5) go a 1 PEEL 
rete Ter tery = tert YT 
_ =)" 7 gla) 
ly| zee [2| 
= = (— Lye eae g(myx,y) (9) 
yl 
gl ryx,e) jy 
le] tet 2] 


=5 = 6 ae ll g(myzz) 


Zenyr hele = z | 


We note that v(ryx) = v(7) + 1, and that »(ry) = v(m). By summing 
formula (9) over paths of length 2m = |x| — 2 belonging to P N Ko 
and over y « Az such that r,, = 1, we obtain 


ds... > + \ aay II g(ary,z) 





| x | yea, AA bie zeny er 
Tey=l U(r)= -\z :|—2 
10) 
vr ie " v(m) % : ( 
ata Te ~ (-1 terre : 
ze€T weP {\Kox zen 

U(r)=2m—|z| 

pe(m)eAy 


ape(m)=z 
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By the hypothesis of induction 
1 VT 
3 > 4) 1? Sor aCeo): (411) 


[a] v | yeAy weP (\ Koz zer z wv | a | yeAg 
Pry=l U(4)=2m—2—|2| 


Also, it can be seen that for y « Az 

fre PO) Ky: Ur) = 2m — |x| —1 and pe(r) #2} 
Utary:re PO Koz, te = 1, and I(r) = 2m — |a] — 2} (12) 
= {re PN Ky: U(r) = 2m — |y |}. 


Combining (8) and (10), using (11) and (12), and applying the 
hypothesis of downward induction, we find 


> 1" TI ae 


weP (\Kox ZeT 
ine 2m || 


pe(m)eAy 
1 VT 5) 
Se ie ae) 
| xv | yeAy ead Kou zen | zZ “Jel ara xv | yeAy 
= = » {¢m(y,0) = Prylal 00) }. 
| v | year 


Together, (7) and (13) complete the inductive step. 


XV. CALCULATION OF Cm(x,\) 


In order to give an explicit formula for cn(x,\) for \ > 0 we suppose 
that, for each path zw on S, all the upward (in <) transitions on 7 are 
of two kinds, denoted by the symbols \ and e. (The calculation we 
present is more easily understood if the new calls labeled ¢ are thought 
of as due to the increment e in calling rate, while those labeled \ are due 
to the original calling rate \.) In other words, we consider the set of all 
paths z on S as (partially) labeled by assigning either or « to each 
upward transition. Formally, we define a labeling \(-) of a path 7 = 
{Xo , 1, ..., Lz} to be any function defined for 2, 2, ..., «, with the 
property 


eae A if aw Covers Xi, 
~ (0 otherwise. 


The set of all possible labelings of a path aw is denoted by A(z), and 
membership therein by the notation \(-) « A(a). The functions d(-) 
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should not be confused with the constant X. A path a together with a 
labeling A(-) of x will be called a labeled path and denoted by (7,A(-)). 
The index e(,\(-)) of a labeled path is the number of times \(2) as- 
sumes the value ¢ for x e x. 

Let A(-) € AC) be a labeling of t = {a,21,..., 2X7}. The function 
h(a,d(-),°) is defined by the condition 


h(wd(+),2) = || + As(px) 
if x is on w and a loop of the first kind on zw ends at x, with the first 
(i.e., upward going) leg of the loop labeled ¢ by X(-), i.e., 
(pe) = «, 
and by 
h(mA(-),7) = 1 


in all other cases. 
For a path w and a labeling \(-) € A(w), the function ¢(7,A(-)) is 
defined by 


Cans {the number of loops of the first kind on x 
~ \Jabeled ¢ on the upward leg by )(-). 


Theorem 13: For} > 0 and m = 0 
Co(0,A) => Som 
and for x > 0, 


= ‘ AL h(arC-),y) 
Cn(2,h) = 2 yleDi2¢ FAG) ) Y) 
an csr oe LTE xs@y" 14) 
(2) €A (rr) 
e(a,A(+)) =m 
Note that by Theorem 5, with z = 0, formula (14) for m = 0 reduces, 
as it should, to 


_ pz(d) 
Co(2,A) Do(r) 7 


Also, formula (14) agrees with formula (4), Theorem 12, as X is allowed 
to approach zero, since in this limit only (2,\(-)) which are full of ’s 
contribute, with m = $(l(7) + |a|). 

Proof of Theorem 18: The values of co(x,\) and cn(0,\) are conse- 
quences of the definition 


on _ pe(d\ + e) 
mu €"'Cm(t,) = Sr urUrE (15) 
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The equilibrium condition Qp = 0 for traffic parameter \ + € comprises 
exactly the equations 


[]a| + (A + €)s(x)]pe(d + €) 
= 2, Pid +) + + €) 2, Puld + e)ae ve 8. 


Dividing by po(A + €) > 0, substituting the expansion (15), and col- 
lecting coefficients of like powers of e, we find that c,,(v7,\) for ce S 
satisfies the equation 
[|e] + As(x)Jen(w,r) = a [Cm(Y,A) — TryCm—a(x,))] 
yeAy 
(16) 
+ 2, Tuelema(yrr) + dOm(Yd)]. 
YeBy 

It can be verified, using the fact that for any labeling \(-) of a path 

a the number of times A(«) has the value \ for x ¢€ 7 is 


iar) + | a | 
AmT IE — ea at-)), 


that formula (14) gives a formal solution of equations (16). 

To prove the theorem it suffices to show that the infinite sum over 
we P f) Koz in formula (14) is absolutely convergent, and that the 
left-hand side of formula (15), with the c,,(2,\) as given by the theorem 
is absolutely convergent for e small enough. 

We first observe that for each z and \(-) summed over in formula 
the factors h(w,A\(-),y) in 


h(wrA(-),y) 
ver | y | + As(y) 


are uniformly bounded, and that at most min(m,v(2)) of them are 
greater than unity. Also, the number of upward transitions (new calls) 
along a path w e PA Koz of length l(7) is just 


Mar) +{a| 
aaa eee 


Thus there are exactly 


ae ee 


ways of labeling the upward transitions on a path we Pf) Koz with 
length (7) and index m. 
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Hence for some constant a > 0 


l x 
| Cm (2 r) | < a” > @+leD/2 diel a oe . 
) — weéP (Koz m yer ly | + As(y) 


By Lemma 7, with {x; , 7 an integer} of the lemma defined in terms of 
the matrix Q appropriate to our congestion problem, 
y |x| + rs(x) do pete TT 1 


wreP (\Kox 


z>0 As(0) L(r)=k ver |¥| + As(y) 
=Pr{z; 40 for 1sSitSk|a =O}. 


By ex. 19, p. 378 of Feller,* there exists 0 < gq < 1 such that the proba- 
bility on the right is at most g" for allk = | S|. Hence 


> | em(2,d) | S const. a” >> (* \e 


x>0 k=m 


This proves that (14) converges absolutely, and that the left side of 
(15) converges absolutely for | «| small enough. 
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Spectral Characteristics of Digit- 
Simulating Speech Sounds 


By D. P. BORENSTEIN 


(Manuscript received July 11, 1963) 


A spectral analysis has been performed on a number of spoken vowel 
sounds, in particular those sounds causing digit registration in a TOUCH- 
TONE receiver. The analysis, implemented by computer methods, provides 
a definitive picture of the nature of digit simulation in TOUCH-TONE 
calling. 


I. INTRODUCTION 


A digit simulation in TOUCH-TONE calling (Ref. 1, pp. 9-12, 15-16) 
is, by practical definition, a speech segment capable of causing digit 
registration in a TOUCH-TONE signaling system. Spectral analyses 
have been performed on a number of speech segments, each of which 
was selected solely on the basis of having the above property. Briefly. 
a valid TOUCH-TONE ‘signal requires the simultaneous presence of 
two code frequencies for a certain minimum length of time, and with 
some minimum signal-to-noise ratio. It was therefore theoretically an- 
ticipated (Ref. 1, pp. 10-12) that each of these speech segments would 
be linked by two other common characteristics: (1) a frequency spectrum 
having two sharply dominant peaks, and (2) a high degree of periodicity 
for some minimal length of time. 

There is good reason to believe that speech segments of this general 
nature are likely to be troublesome in any signaling system based on the 
transmission of voiceband tones over speech channels. 

Due to the inherent rarity and relatively brief’ duration of the voice- 
produced digit simulation, some special procedures were required both 
in obtaining and analyzing these speech segments. The remainder of this 
article comprises a description of these procedures, followed by a 
presentation and discussion of the resulting spectral analyses. 
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Fig. 1 — Apparatus for recording digit-simulating speech segments. 


II, COLLECTING THE SPEECH SAMPLES 


The digit-simulating speech segments were obtained by recording 
raw speech onto magnetic tape loops with the two-track recording 
arrangement shown in Fig. 1. 

Using a 60-inch loop of tape at a speed of 15 in/sec, speech is continu- 
ously recorded at point A, played into a standard TOUCH-TONE re- 
ceiver at point B, and, if there is no receiver output, erased at point Ei 
after traversal of the loop. Simultaneously, on a second track, a 10-ke 
pilot frequency is continuously recorded and erased at points B and C, 
respectively. If at any time there is a TOUCH-TONE receiver output, 
indicating the presence of a digit-simulating speech segment just past 
point B, the timing network is triggered. The timing network then 
performs two operations: (1) it disables the 10-ke record and erase after 
a delay of 35 ms, and (2) it stops the tape transport after a delay of 2 
seconds (half the loop traversal time). 

This process yields a 60-inch length of tape consisting of about 29 
inches each of pre- and post-simulation speech plus a 1.5-inch (110 ms at 
15 in/sec) segment which contains both the actual simulating speech 
sample and the 10-ke pilot frequency. In this manner, fourteen such 
samples were obtained, at the average rate of about one per ten hours of 
raw speech — an indication of the extreme rarity of simulation with the 
present TOUCH-TONE receiver. 


II. ANALOG-TO-DIGITAL CONVERSION AND PRINTOUT 


By means of encoding equipment developed by the Acoustics Research 
Department, the fourteen digit-simulating speech segments were con- 
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verted from analog form to an eleven-bit digital signal. The sampling 
rate of 10 ke was gated directly from the pilot track of the original analog 
tape, thus eliminating sources of error due to tape flutter during the 
original recording process. Once the digital tape was obtained, the 
conversion process was reversed to obtain an accurate X-Y recording of 
each of the fourteen speech waveforms. Visual inspection of these 
waveforms, two of which are shown in Tig. 2, confirms their periodic 
nature (the periodicity of the samples shown in Fig. 2 would be still 
more evident were it not for the fact that most speech fundamentals are 
considerably attenuated by telephone apparatus). 


IV. SPECTRAL ANALYSIS 


The fourteen speech samples, in eleven-bit digital format, were then 
subjected to a “pitch synchronous’” Fourier analysis on the IBM-7090 
computer. The pitch synchronous analysis consisted essentially of a 
conventional Fourier analysis performed on each successive funda- 
mental pitch period in the speech sample. These pitch periods, in turn, 
were determined on the computer by counting the number of sampling 
intervals (each being 100 usec) between successive maxima in the wave- 
form and then interpolating between samples for greater accuracy. This 
method of lourier analysis is ideally suited to waveforms that maintain 
an almost-periodic structure over an appreciable length of time. 

Tor each speech segment analyzed, the computer output consisted of 
a sequential set of bar graphs, one for each fundamental pitch period of 
the speech waveform. Each graph, in turn, is a plot of harmonic ampli- 
tude (the Euler coefficient) in db versus harmonic number. In addition, 
each graph gives the “instantaneous pitch” (i.e. the reciprocal of the 
period) of each fundamental period analyzed. Figs. 3 and 4 show the 
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Fig. 2 — Analog waveformsof two digit-simulating speech segments. Also shown 
are the sex of each speaker and the particular phoneme causing the simulation. 
Fourier spectra of digit simulations 1 and 2 are shown in Figs. 3 and 4, respectively. 
Arrows indicate periodicity, with the large arrow showing the approximate start 
of the digital simulation. 
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Fig. 3 — Set of Fourier spectra for digit simulation No. 1 (as shown in Fig. 2). 
Each X represents a 1-db relative amplitude increment. Spectra are in alpha- 
betical order,with respect to time. 
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Fig. 3 — (continued) 


two sets of spectra corresponding to the two speech segments whose 
time domain waveforms appear in Fig. 2. 


V. DISCUSSION OF RESULTS 


Several aspects of the spectra shown in Figs. 3 and 4 are worthy of 
note. 

First of all, it is seen that these two speech segments (as well as the 
twelve others not shown here) do indeed satisfy the two properties 
anticipated in the introduction. The high degree of periodicity of these 
speech waveforms is spectrally confirmed by noting that in both se- 
quences of spectra the harmonic structure remains extraordinarily 
uniform. (This result also confirms, by hindsight, the original validity 
of a period-by-period Fourier analysis.) By noting the fundamental 
pitch (thus the period) of each segment, it is seen that this highly stable 
harmonic structure is maintained for at least the 28 milliseconds which 
coincides with the duration requirements of the TOUCH-TONE re- 
ceiver. 
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Fig. 4— Set of Fourier spectra for digit simulation No. 2 (as shown in Fig. 2). 
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Secondly, one finds immediate justification for the fact that these 
speech segments caused digit simulation. By multiplying the funda- 
mental pitch of any segment by the orders of its two dominant har- 
monics, a valid TOUCH-TONE calling signal* is derived. Thus a voice- 
produced digit simulation is spectrally analogous to a valid TOUCH- 
TONE signal accompanied by noise, with the sole exception that in the 
former case both the “noise” and signal components are integral mul- 
tiples of a discrete fundamental frequency. Indeed, this sole distinction 
between a digit simulation and a valid signal might possibly be used to 
provide further simulation protection in future voice-frequency signal- 
ing applications. Specifically, a receiver might be designed to be sensitive 
to the presence of selected harmonics and/or sub-harmonics of valid 
signal frequencies, and thereby to reject many speech phonemes which 
would ordinarily cause simulation. 

In the portions of Figs. 3 and 4 where the harmonic structure is notice- 
ably changing with time (namely at the beginning and end of each series 
of spectra) pitch-synchronous Fourier analysis can be regarded as only 
an approximation of spectral density. For some applications, however, 
the approximation is still useful. In the first place, one can obtain a 
practical ‘feel’ for the rate of change of pitch and harmonic structure 
in vowel-type speech sounds. Also, from the standpoint of digit simula- 
tion, by examining the spectra one can ascertain just how and when a 
speech segment becomes a digit simulation. For example, in the early 
spectra of Fig. 3, although pitch requirements for digit simulation are 
satisfied, the 10th harmonic competes with the 7th harmonic for limiter 
capture, and receiver recognition is prevented by limiter guard action 
(i.e., insufficient signal-to-noise ratio). (See Ref. 1, pp. 10-11, 13.) 

On the other hand, although early spectra of Fig. 4 show an acceptable 
harmonic structure for digit simulation, the pitch is slightly too low for 
receiver recognition. In a similar manner, one can determine how and 
when a digit simulating wave-form starts to degenerate. 

Admittedly, the speech segments chosen here are both rare and few in 
number. Thus, one cannot draw conclusions of statistical significance 
from this study. However, there is no reason to believe that any other 
group of frequencies of the same capacity in the voice band would not 
be simulated by the voice about as often as were these TOUCH-TONE 
calling frequencies. Therefore, such vowel-type speech segments may be 

* A valid TOUCH-TONE calling signal consists of one frequency from each of 


two groups: a low group — 697, 770, 852 and 941 cps +2.5 per cent — and a high 
group — 1209, 1336, 1477 and 1633 cps +2.5 per cent. 
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looked upon as potential digit simulations in almost any proposed voice- 
frequency signaling application. 
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Reflections from an Exponential 
Atmosphere 


By K. BULLINGTON 
(Manuscript received August 6, 1963) 


A quantitative explanation of tropospheric radio propagation is derived 
without the use of arbitrary numerical factors. It is based primarily on the 
average value and the standard deviation of the index of refraction at the 
earth’s surface, both of which decrease exponentially with height. 

This method seems to bridge the gap between the internal reflection and 
scatter hypotheses. The quantitative results are in good agreement with ex- 
perimental data on distance dependence, frequency dependence, climatic 
and seasonal variations, and effective antenna gain. One consequence is a 
new concept of the decrease in effective gain of narrow-beam antennas, which 
accounts for some hitherto unexplained experimental results. 


I. INTRODUCTION 


Several conflicting hypotheses have been proposed over the years in 
an attempt to explain the transhorizon radio field intensities, which in 
some cases are hundreds of decibels stronger than predicted by smooth 
earth diffraction theory.'-® The rapid fading ordinarily associated with 
this type of transmission Jed naturally to the concept of many scattered 
components with random phase angles. 

In order to account for the average field intensity, frequency depend- 
ence and other characteristics, the scatter hypotheses must make a num- 
ber of more or Jess arbitrary assumptions regarding the number, size and 
distribution of the irregularities and whether the scatterers are spheri- 
cal “‘blobs”’ or flat ‘“‘pancakes.’”’ The scatter concept is a statistical frame- 
work that can be adjusted by arbitrary parameters to fit almost any con- 
sistent experimental data and, in fact, the values of the parameters 
have been changed significantly when the need to accommodate unex- 
pected experimental results has arisen. 

On the other hand, Snell’s law of reflection indicates that reflections 
must occur from the gradual decrease in dielectric permittivity with 
height. This concept focuses attention on the average received power but, 
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of course, it is recognized that the fading must be associated with the 
motion of the irregularities in the atmosphere. The object of the internal 
reflection hypothesis is to relate the average received field to the rate 
of decrease in atmospheric density, and hence to eliminate some of the 
arbitrary parameters required by the scattering concepts. Many theorists 
have concluded that the internal reflections cancel, or at least that their 
effect is too small to be important. 

If the problem of inhomogeneous atmosphere could be solved rigor- 
ously, there would be no difference of opinion, but a rigorous solution 
that can be readily evaluated has not yet been achieved without signifi- 
cant approximations, and the relative accuracy of the different approxi- 
mations is in question. 

The purpose of this paper is to present a new approach that to the 
present author seems to bridge the gap between the internal reflection 
concept and the scatter hypotheses. Although the method is not as 
rigorous as would be desired, it does provide quantitative predictions 
that are in excellent agreement with experimental data without the use 
of arbitrary numerical factors. 


II. METHOD 


The procedure is to find first the magnitude of the internal reflection 
from a smooth exponential atmosphere and then to superimpose ran- 
dom variations on the exponential decrease. 

The difference in dielectric permittivity « between two horizontal 
planes at heights y, and y2 is assumed to be a random variable with a 
Gaussian distribution. The average value depends on the average index 
of refraction -at the surface, together with the exponential decrease in 
atmospheric density with height; the variance at any height depends on 
the variance in the surface index of refraction together with the same 
exponential, decrease with height. The assumed rate of exponential de- 
crease or scale factor cuts in half the atmospheric density, mean value 
of permittivity and-variance in permittivity for each increase in eleva- 
tion of 7 or 8 kilometers. These quantities are based on meteorological 
data and are independent of transhorizon radio experiments. 

Jt is further assumed that the reflection coefficient is proportional to 
the change in dielectric permittivity and hence is also Gaussian dis- 
tributed. The average reflection coefficient is derived from electromag- 
netic theory, as a function of the average exponential decrease in per- 
mittivity. 
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Finally, it is assumed that the variance of the reflection coefficient is 
related to the variance of the dielectric permittivity by the same func- 
tion that relates the average reflection coefficient to the average permit- 
tivity. 

The resulting reflection coefficient is Gaussian distributed and has 
both positive and negative values, which are almost equally probable 
when the standard derivation o is large compared with the mean value 
uy. For comparison with experimental data it is necessary to find the mean 
of a Gaussian distribution taken without regard to sign, and this is given 
with sufficient accuracy by either the mean value uw or ~/2/20, which- 
ever is the larger. The former is usually controlling below 100 me and the 
latter is usually controlling at UHF and above. 

This paper is not intended to be a complete description of all tropo- 
spheric phenomena. Some variations exist in the horizontal plane but 
they are small compared with the variations in the vertical plane and 
have been neglected in order to simplify the presentation. The resulting 
modified Gaussian distribution is similar to a Rayleigh distribution; 
consequently, this derivation has a fading component, but it does not 
give quantitative results on either the speed of fading or on the associ- 
ated problem of useful bandwidth. There is a hint that the speed of fad- 
ing depends on the number of Fresnel zones in the common volume and 
that the useful bandwidth varies inversely as this quantity, but further 
work is needed. 

In addition, this paper is concerned only with the effects of the at- 
mosphere that have been neglected in the smooth sphere diffraction 
theory. The latter will be controlling at short distances and low fre- 
quencies. 


Ill. EFFECT OF SMOOTH EXPONENTIAL PROFILE 


The permittivity of the atmosphere is assumed to vary only in the 
vertical direction, and to have a mean value and a variance, both of 
which decrease exponentially with height. Specifically, the mean value 
of dielectric permittivity « at any height illustrated on Fig. 1 1s assumed 
to be 


e=et(l—.6)[l—- ee]. (1) 


It follows that the change in dielectric permittivity in the region between 
y = y, and y = y is given by 


Ae, = @—4@ = (1 —- ee OTT eve 4)) (2) 
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Fig. 1 — Geometry of transhorizon path. 


where e, = dielectric permittivity at the surface (y = ys) 
b = rate of exponential decrease 
=~ 0.14 per kilometer. 


The parameter b is primarily a measure of the rate of decrease in at- 
mospheric density, which is in turn determined by the laws of gas dif- 
fusion and the law of gravity. The average value of b varies from about 
b = 0.12 per km near the earth’s surface to about 6 = 0.16 per km at 
elevations greater than 7 or 8 kilometers, but it can be considered a 
constant in this application.* 


* The parameter b is related as follows to the widely used effective earth radius 
factor k (which is limited to regions where 1 — e~ = by) 
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For the typical case of N, = 314 and b = 0.125/km, the effective earth radius factor 
isk = 4/8. 
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When a change in dielectric permittivity occurs at a sharp boundary 
as shown on Fig. 2(a), the reflection coefficient g for either polarization 
can be derived from Snell’s law with the following result 


_ €2 — €& 
= 4 sin? a (3) 
or 
4o’q Ye 4 


where a is the angle between the ray and the boundary. This result as- 
sumes | 6 — «| < 4 sin” aand sin a & a. The corresponding reflection 
coefficient for an exponential decrease illustrated in Fig. 2(b) is derived 
in a later section with the following result 


b 


2 = _ Pe Seen . | 
4a'q = (€ a) paypal” (4) 
where 
_ dd _ 2a 
2rya 
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\ = wavelength 


as | Perl tA | 
: 1 — exp [—b(y2 — m)] 
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path distance 
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= antenna beamwidth a (yo — yr) 


M = number of pairs of Fresnel zones in the common volume 
formed by the intersection of the two antenna beams 
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Fig. 2 — Idealized variation in dielectric permittivity. 


The factor F; is unity whenever M is an integer; this is the minimum 
value and this is the case of interest.” The assumption of fractional 
Fresnel zones would lead to | F,| > 1, but realizable antenna patterns 
cannot cut off sharply enough to make this assumption meaningful. 
Nonexponential models of the atmosphere lead to factors analogous to 
I’, which are either zero or indeterminate, and presumably this is the 
basis for the widespread assumption that internal reflections cancel. 

Substitution of (2) into (4) leads to the following expression for the 
reflection coefficient of a smooth exponential atmosphere 


us EE 1) gat 
on foL-s)]} « 
{|e PPAR ees eas 


Ara = aby 


Ara 


* The use of pairs of Fresnel zones is an artifice introduced by Fresnel to solve 
related problems in optics. By this means, a difficult vector addition problem is 
changed into a much easier scalar one because the phase shift in each pair is 
exactly 360° by definition. A small percentage change in the number of pairs of 
zones cannot change the sum significantly, so the exact number of pairs is not 
critical. If, however, the summation is taken over a region that cuts off sharply 
at an odd or fractional zone, the sum depends critically on the exact nature of 
the cutoff. In this case, the magnitude of the vector sum oscillates over a rela- 
tively wide range and mathematically the summation is very difficult or indetermi- 
nate. 
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The term in the first brackets represents the reflection coefficient that 
would have been expected if all the decrease in dielectric permittivity 
had occurred at a sharp boundary at height y, = Da/2. The term in the 
second brackets is the effect of an exponential decrease for wide-beam 
antennas, (as long as this factor is less than unity). The third brackets 
show the effect of narrow-beam antennas, which depends on distance as 
well as beamwidth. As discussed above, the factor F'; can be set to unity. 
The term in the right-hand brackets can also be neglected for a rela- 
tively smooth earth path, because y;  2y, for this case. 

Typical values of the reflection coefficient ¢ computed from (5) are 
shown in Table I for wide-beam antennas; that is, only the first two 
bracketed terms are used. 

The estimated losses given in Table I are much closer to the meas- 
ured results than are those obtained from smooth earth diffraction 
theory but they are still 10-80 db greater than the median measured 
data. In addition, (5) is incomplete because it is not quite adequate in 
other respects. 





TABLE [ 
Distance beyond Line of Sight 20 log | qg |, decibels 
Qa 
km miles 300 me 1000 mc 3000 mc 
100 62 0.45° —49 —59 —69 
200 124 0.9° —68 —78 — 88 
500 310 2.25° —97 —107 —117 
1000 620 4.5° — 1383 —148 —153 





The experimental results on the variation of the received voltage with 
frequency seem, on the average, to be closer to ~/X rather than \ as 
shown in (5). Seasonal and climatic effects are known to be significant, 
but the only possible variable in (5) is (e; ~ 1), and the seasonal 
changes in (e, — 1) are too small to account for the observed results. 
Finally, the fixed exponential atmosphere assumed in (5) cannot hope 
to explain the fading phenomena unless the variations in the atmosphere 
are considered. 


IV. EFFECT OF ATMOSPHERIC VARIATIONS 


Random fluctuations in atmospheric pressure, temperature and 
humidity are superimposed on the average value. The resulting varia- 
tions in the index of refraction (n = ~/e) at the earth’s surface have 
been measured in many experiments, but perhaps the most complete 
and reliable results can be obtained from long-term weather data.”” 
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An analysis of meteorological data for 8 years from 45 U.S. weather 
stations indicates that the standard deviation of the surface index of 
refraction has a long-term average in the vicinity of Washington, D.C. 
of 10-15 N units, which corresponds to a variation in e, of a(e,) 20 to 
30 X 10°°. In the northern part of the U.S., o(e,) may be as low as 
4 X 10° (2 N units) and along the Gulf and in Florida it may be as 
high as 52 X 10° (26 N units) in certain hours and seasons. The vari- 
ance at the earth’s surface can be defined as 


variance (e,) = [o(e)]° = [6(es — 1)f. (6) 


Since (e; — 1) is of the order of 600 X 10°, the average value of 10 to 
15 N units corresponds to 6 = 3 to 5 per cent at the surface. 

At very great heights the variance of « must approach zero. If the 
dielectric permittivity followed a smooth exponential decrease and e, 
were the only variable, the standard deviation « would decrease at the 
same rate as the mean value given in (2), but this does not agree with 
the experiment. Experimental data based on meteorological measure- 
ments indicate that the standard deviation o decreases more slowly than 
the mean value.” A better fit with experimental data can be obtained by 
assuming that the variance (c’) and not the standard deviation o de- 
creases at the same rate as the mean value. Quantitatively this means 
that the standard deviation of the dielectric permittivity, which 
is measured to be 3 to 5 per cent of the mean value at the surface, is 
taken to be about 8 per cent of the mean value at an elevation of 14 
kilometers, 16 per cent at 28 kilometers, 32 per cent at 42 kilometers, 
etc. At higher elevations this model breaks down because the standard 
deviation cannot be greater than the mean value in a nonionized me- 
dium. Heights greater than 50 kilometers are approaching the ionosphere 
and, hence, are outside the usual range of interest for tropospheric trans- 
mission. On this basis, the variance at y; over the interval y. — y; Is 
given by 

variance (e — «) = [8(es — 1) Pe [1 — 2] (7) 


or 


o(€2,1) = d(e, — 1) exp (nw) V1 — ew), 

The dielectric permittivity of the atmosphere, as used in this paper, 
has now been specified to be uniform in the horizontal plane and to have 
an exponential variation in the vertical coordinate with a mean value 
given by (2) and a variance given by (7). The mean value for the re- 
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flection coefficient corresponding to (2) has been given in (4) and (5). 
Inasmuch as the irregularities superimposed on the exponential decrease 
cause the variance of the dielectric permittivity to vary as its mean, it is 
assumed that the variance of the reflection coefficient is also proportional 
to its mean value. This critical assumption leads to 


b 


b+ maf 


variance [4a’q] = variance (e — «) 


which by means of (7) leads to 


AGE: EE = 1) ae D ./,— nin a F 


4a? 4ra 
/eaf (3 -¥)] 
abr 
VV ae 

The interpretation of (8) is essentially the same as for (5). The first 
two terms to the right of the equality sign are the important ones for 
wide-angle antennas, the third of the five factors is the correction for 
narrow-beam antennas and the two factors on the right are unity in 
most applications and hence can ordinarily be neglected. 

When the standard deviation (8) is comparable to or larger than the 
mean value (5), the resulting distribution of reflection coefficients con- 
tains both positive and negative values. The significance of a change in 
sign is to introduce an uncertainty in phase delay of one half cycle, which 
is unmeasurable and unimportant in most radio applications and tests. 
In order to compare with experimental data it is necessary to “fold over” 
the negative part of the distribution, add it to the positive part and 
then to find the mean value of the resulting distribution. 

The mean value of the folded distribution approaches the mean value 
uw of the Gaussian distribution when ¢ < p» and approaches ~/ 2/70 
when o > uw. More precisely, the mean value of the folded distribution 
is given by 

uw (folded distribution) = / 2 Pe ak aes © perf (5) 
T oV/2 
where yw and o refer to the original Gaussian distribution. 

The folded distribution indicates that fading is to be expected, at least 
on a diurnal and seasonal basis. The use of meteorological data taken at 
hourly intervals does not provide quantitative information on the fast 


2858 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


fading, but it does not preclude it. Moreover, the fading can be larger 
and at a faster rate than indicated by the variance in the dielectric 
permittivity taken over the entire vertical height of the common volume. 
The fading depends on the variations within each pair of Fresnel zones 
rather than on the standard deviation of the entire region. The standard 
deviation of the variation within each pair of zones is of the order of 
4/ M times the standard deviation of their sum, where M is the number 
of pairs of Fresnel zones. 

In addition to the reflection coefficient of the atmosphere, the trans- 
mission loss between two ground-based antennas relative to free space 
is also affected by ground reflections near each terminal. Ground reflec- 
tions introduce image antennas which in effect double the amplitude of 
the received field intensity. This factor of 2 (or more precisely 1 + |r |, 
where r is the reflection coefficient of the ground) appears in plane-earth, 
smooth-sphere and scatter theories, and it is unimportant numerically 
whether it is considered as twice the free-space field incident on the 
atmosphere with no effect near the receiving terminal or considered to be 
a power addition of the antenna and its image at both terminals. 

Consequently, the mean value of transmission loss (relative to free 
space) to be compared with experimental data is the larger of 2¢ from 
(5) or 2+/2/xo from (8). At frequencies above 300 me the latter is 
controlling, and typical values are shown in Table IT for 6(e, — 1) = 
25 X 10°, which corresponds to a standard deviation at the earth’s 
surface of 12.5 N units. 

Tor comparison with the scatter hypothesis, the first two terms in 
(8) lead to the following ratio for the average received power relative 
to free space for wide-beam antennas 

2 —bDa/4 
i = 2\/2/rol’ = brlo(es)Te 


870° (9) 


V. INTERPRETATION OF RESULTS 


The computed transmission losses shown in the above table for wide- 
beam antennas are compared on [ig. 3 with the average experimental 














TABLE II 
Distance Transmission Loss Rae to Free Space, 
km miles 300 mc 1000 mc 3000 mc 
100 62 0.45° 44 49 54 
200 124 0.9° 60 65 70 
500 310 2.25° 82 87 92 


1000 620 4.5° 106 111 116 
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Fig. 3 — Transhorizon transmission loss for widebeam antenna. 


data as given in a recent CCIR Study Group report. Since the computed 
values are based on the distance between the two horizons, while the 
experimental data are based on total path length, the computed values 
have been plotted on Fig. 3 at a distance of 30 km greater than given in 
the above table in order to allow for the line-of-sight distance between 
each terminal and its horizon. This assumption corresponds to an antenna 
height of about 50 feet over smooth earth. Any other reasonable assump- 
tion is not likely to change the comparison significantly. 

The computed values are somewhat lower than the average experi- 
mental data as given in the CCIR Study Group Report, but at least 
part of the difference (in decibels) may be due to the assumed value of 
antenna to medium coupling loss used in the analysis of the original ex- 
perimetal data. As shown in the following paragraph, the effect of 
narrow beam antennas derived herein is ordinarily less than obtained 
from the method used in the CCIR Study Group Report.” 

The antenna degradation factor given in (8) is 

VI = eR = V/T = envy, 
and is evaluated in Table III. 

The db loss shown in the second column of Table III depends on the 
height of the common volume (y2 — y:). When the path is symmetrical 
and the two antennas have equal beamwidth, the corresponding path 
length for a given loss and beamwidth is shown by the right-hand col- 
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TasieE III 


Deckeave innicetive Distances for Various Beamwidths 


(y2 — m1) Antenna Gain 
(Both Antennas) 


B= 2 p=1° B = 0.5° 
300 meters 13.9 db 17 km 34 km 69 km 
1000 8.9 58 115 230 
3000 4.7 172 345 690 
10,000 1.2 575 1150 2300 
20,000 0.27 1150 — — 


umns. These results are compared on Tig. 4 with typical values of the 
antenna-to-medium coupling based on scatter concepts for a 1° antenna. 
Although most of the experimental data are near the crossover region, 
the opposing trends provide a good basis for a critical experiment. 

A previously unexplained experimental result at 400 me on a 1000-km 
path is that antennas up to 120 feet in diameter (beamwidths greater 
than 11°) showed less than 1 db loss in effective antenna gain when 
measured at one terminal only.“ This result agrees with the present 
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Fig. 4— Loss in effective antenna gain. 
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method but is in sharp disagreement with earlier predictions. At the other 
extreme, measurements at 9000 mc on a 46-mile path with 4-foot and 
1-foot antennas (about 1.6° and 6.6°) showed a loss of about 9 db when 
a change in antenna was made at both terminals. The data on Fig. 4 
would indicate no more than 6 db for a 4-to-1 change at short distances, 
but is closer to the measured result than the earlier methods which in- 
dicate 2 to 4 db. 

Long-term comparisons between 8-foot and 60-foot antennas at 4000 
me (1.9° and 0.25°) on a 171-mile path indicated about 12 db loss in an- 
tenna effectiveness.!® On the average the signal received on the 60-foot 
antenna was 5.7 db greater than on the 8-foot antenna, rather than 17.5 
db, as would be expected in free space. The resulting loss of 11.8 db com- 
pares with an expected value of 10 log 6 = 7.8 db, since the path profile 
given in the reference indicates that the height of the common volume 
was reduced by a factor of 6. 

The conclusion on the loss in effective antenna gain is that although 
the data on Fig. 4 seem a few db too low, the general shape of decreasing 
loss with increasing distance is to be preferred to the opposite trend de- 
rived from scatter theory. 

It will be noted that the voltage ratio given in (8) varies as Vx, 
which is in general agreement with experiment, and that variations in 
5(e. — 1) can be 20 db or more with variations in climate, season and 
time of day. At UHF and above, the standard deviation (8) is large com- 
pared with the mean value (5), so considerable variation with time is to 
be expected; in the lower VHF and below, the standard deviation for 
the usual distances is less than the mean value, and this is consistent 
with the reduced amplitude of fading at the lower frequencies. 

The remainder of this paper outlines the derivation of the reflection 
coefficient of a smooth exponential atmosphere, which was stated without 
proof in (4) and (5). 


VI. DERIVATION OF REFLECTION FROM A SMOOTH EXPONENTIAL ATMOS- 
PHERE 


Some reflections must occur whenever the dielectric permittivity 
changes. When the change occurs at a sharp boundary, the reflection 
coefficient is proportional to the difference in dielectric permittivity; 
when the dielectric permittivity is constant with height, the reflection is 
zero. The problem is to find the reflection coefficient for the intermediate 
case of a gradual change in permittivity. 

A useful formulation of this problem was published by Schelkunoff in 
1951.° The geometry and three possible variations are indicated on Fig 
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Fig. 5 — Idealized geometry for reflection from stratified media. 


5. It is assumed that the region in which the dielectric permittivity is 
changing is sandwiched between two uniform media of dielectric permit- 
tivity « and e , respectively. The reflection coefficient q of the stratified 
medium is derived from wave equations with the following result 


= af aloe K(#) .-%9 ag 


where 


do 


=5 "1 dK 


ote 
oma ee 


K = 4/7 (¢~ 6 + sina) 








dk & o 
2 sin? a 
Qrya 2rHa 
=S= OO _—_- Si A 2 g = 
g Xr Yy; nN 


= AH 


\ = wavelength in same units as height y 


H = y2— Yi. 
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It follows that for (e — «) <sin’ a 


wy) fi de ie dd (10) 
ied 4 sin? a /o dd . 


Yor a linear variation corresponding to 
? 
= at x (a — 4) 


the reflection coefficient reduces to 


sin 0 —49 _ (e — 4) 
es € ) R ——— “A sin? a. . 
sin? a 


The factor R is the reflection coefficient expected for a sharp boundary 
with the same total change in dielectric permittivity, and 6 is the total 
phase change in the inhomogeneous media. 

In a similar manner, the reflection coefficient for the antisymmetrical 
cubic illustrated by case 2 is given by 


q = Ro (ane cos 0) 


The above examples for linear and S shaped curves are given by 
Schelkunoff. The same general method has been used more recently by 
du Castel, Misme and Voge to obtain the reflection coefficient for the 
additional cases of parabolic, sinusoidal and hyperbolic variations.” 
Their results can be summarized as follows: 











linear |q| = Ra (11) 
sinusoidal |q| = R — “ (12) 
: sin 0\’ 
parabolic |g| = R 7 (13) 
6 


hyperbolic | q| = R (14) 


sinh 6° 
In each case, as 6 approaches zero, the reflection coefficient approaches 
the value given by Snell’s law for an abrupt change. 

The effect of a gradual decrease in dielectric permittivity is to reduce 
the reflection coefficient below what would have been expected if all of 
the change occurred at a sharp boundary. The reduction in reflection 
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coefficient depends on the slope of transition and not on the boundary 
conditions alone. For the linear case it varies as 1/6 and for the hyper- 
bolic case it ultimately decreases exponentially. As @ approaches infinity 
the reflection coefficient goes to zero for these particular cases, but before 
reaching the conclusion that g always goes to zero as the upper limit goes 
to infinity, it is instructive to formulate the general case. . 

In general, the dielectric permittivity « can be represented by 


= €1 + m/(e2 _ €) 


where the parameter m can be any function that varies from m = 0 at 
the lower boundary (e = «,) to m = 1 at the upper boundary (e€ = e&). 
The corresponding reflection coefficient is 
6 
q=R ; on ee dd. (15) 
An exponential decrease with height provides a better match to the 
average atmospheric variation than is possible with any of the above 
models. Consequently, it is instructive to consider the case of m = 1 — 
e ’” where b is the rate of decrease in atmospheric density. The magni- 
tude of b is about 0.14 per km and is determined by the characteristics 
of gases and the law of gravity. For the case of a smooth exponential de- 
crease, the reflection coefficient given by (10) or (15) reduces to 


8 
R I be e °* dd (16) 


a 
I] 
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H 
R | be Oty dy 
0 


__; b +i) 
= pepo 
ee ee be b+ i2A) HH 
“aa pasa (17) 
where 
6= AH 
o = Ay. 


It will be noted that the upper limit H appears only in the exponential 
factor and that the term in brackets approaches unity as H — «. The 
reflection coefficient for a smooth exponential atmosphere is the reflec- 
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tion coefficient R that would result if all the change in dielectric permit- 
tivity occurred at a sharp boundary, multiplied by b/(b + 722A). In 
the atmosphere, b < A for VHF and above, as long as a is greater than 
0.01 degree. 

The bilinear model (uniform decrease with increasing height until 
« = 1) was used extensively in earlier studies, but its usefulness was 
questioned because of the discontinuity at the upper boundary H = 1/b. 
In the present nomenclature the reflection coefficient for the bilinear 
model with a slope b is 


q= R (1 ae gah ; (18) 


b 
A 
This is equivalent to the derivation of equation (2) in Ref. 2. The similar- 
ity between (17) and (18) shows that the use of the exponential decrease 
removes the effect of the upper discontinuity without any significant 
change in the magnitude of the reflection coefficient. 

The foregoing analysis has been based on a rectangular coordinate 
system centered on the lower boundary to facilitate comparison with 
earlier papers. It is now desirable to shift to the geometry illustrated in 
Tig. 1. In this case 


as e Clu ¥s) 
e=eg+fi—e° [ll — al (19) 
=1+(e—1le°™ 
de = —(e — 1)be "dy (20) 


where e, is the dielectric permittivity at the surface. 
Substituting (20) into (10), 


q= _ fe — 1) 1) ells b [re —(b+72A)y dy do 
“4 sin? a Ani dd 
Ce, = 1) bie _ 12A)4 
gene b fOr ay 


which reduces to 


_ fe 1) eo Pu1-us) —i2Ay, b —(b+124) (yo—-y1) 
6 ‘ t 1 a ay) 21 
- 4 sin? : b+ 272A : fe 20) 


From the geometry in Fig. 1 it can be seen that for the symmetrical 
case the grazing angle a % 2y,/D and the antenna beamwidth 6 Y 
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(2/D) (ye — y1). In addition, each successive Fresnel zone corresponds 
to a change in phase of w radians, so y2 — y; = Ma/A, where M is the 
number of pairs of Fresnel zones. With these substitutions (21) can be 
rearranged as follows 


= aD gsm by — comm [t= etme oT 


4sin? a * 2A 1 — e-*pb/2 
v0 (=F =») 
— A ae (22) 
3A 


This equation has already been given by (5). The first bracket to the 
right of the equality sign is equivalent to equation (27) in Ref. 18 except 
that the latter is 2.24 times larger to account for the effect of the earth 
and for a gradual cutoff near the horizon. 

This completes the derivation of (5), but a better understanding of 
the physical problem can be obtained by introducing the factor (1 — 
e °"/4) in both numerator and denominator as follows 


a (és = 1) _ pvorla cel b | 
¢-|fe=% (1 Je b+ 724A 


ne —brM/A —i2Mqr 
[=e |. (93) 


1 — ela 


The terms inside the first brackets represent the magnitude of the 
reflection coefficient that would result if the change in dielectric permit- 
tivity within one pair of Fresnel zones (phase change of 360°) were 
concentrated at a sharp boundary. The middle bracket corrects. for a 
gradual change instead of a sharp boundary. The right-hand brackets 
represent the sum of J pairs of Fresnel zones and will be recognized as 
the sum of a geometric progression of MZ terms whose common ratio is 
et 4e-”™ The factor (1 — e¢ °”) is the fraction of the reflected energy 
that remains in each pair of Fresnel zones after the almost complete 
cancellation caused by opposite phase in two adjacent Fresnel zones. 

The concept of Fresnel zones is used to simplify the problem of adding 
a large number of components with a wide range of phases. By grouping 
into blocks of 27 radians, the contribution from each block adds in phase, 
and it is easy to see that the inclusion of weak components beyond nomi- 
nal boundaries does not cause the answer to disappear. The cutoff merely 
neglects this contribution and the true answer is slightly greater than the 
computations indicate. In other words, the effect of the ‘‘discontinuity”’ 
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at the lower limit of integration, which has proved troublesome in some 
formulations of the problem, seems to have a negligible effect on the 
magnitude of the signal returned from the atmosphere. 

Although the Fresnel zone concept provides a clearer understanding of 
the problem, it is easier for computation purposes to use (5). 
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The Effects of Time Delay and Echoes 


on Telephone Conversations 


By J. W. EMLING and D. MITCHELL 
(Manuscript received August 9, 1963) 


A brief history of the problem of echoes and delay in telephone connections 
is first given. Actual delays involved in typical circuits are shown. This is 
followed by a discussion of the effects of delay only on typical conversations. 
The sources of echo in typical telephone connections are then discussed, to- 
gether with measures which have been taken to reduce echo by improving 
return loss. Methods of controlling the effects of delayed echo are then sum- 
marized; the summary includes a brief introduction to echo suppressors and 
some of the new problems they introduce. 


I. INTRODUCTION 


About 30 to 40 years ago, the effects of time delay and echo became 
of great concern to engineers planning transcontinental telephone sys- 
tems. As a result, much effort was devoted to understanding these ef- 
fects and finding ways to control them.! Fortunately, the development 
of high-speed transmission systems made the problem less severe than 
originally anticipated, and satisfactory ways were found to handle the 
amounts of delay that were then involved. As a result, interest in the ef- 
fects of delay declined and little work has been carried out in this field 
during the last twenty years. But, as happens so often, the historical 
cycle is beginning to repeat. With the growth of intercontinental tele- 
phony and particularly the proposal of satellite systems involving one- 
way path lengths of 50,000 miles or more, the problems of delay have 
again come to the fore. 

Because of this renewed interest, Bell Laboratories has resumed its 
studies of echo and delay. The basic problems, first considered 30 or more 
years ago, have been reviewed to see if improved solutions might result 
from the technology that has evolved in the intervening period. This 
work has been addressed not only to technical problems but also to 
new testing techniques for the evaluation of echo, delay and the control 
of these factors. Throughout the recent program, emphasis has been 
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placed on the longer delays which are the unavoidable result of expand- 
ing the telephone network to provide a truly world-wide service. 

This paper summarizes the earlier information on the effects of delay 
and echo, and outlines the fundamental problems they introduce. It 
thus serves as a guide to the literature of the past and also as an intro- 
duction to two companion papers dealing with the more recent studies 
in this area. The paper by Brady and Helder’? traces the evolution of the 
echo suppressor, outlines the basic design problems involved and covers 
recent work aimed at applying modern technology to its design. The 
paper by Riesz and Klemmer* covers recent work on the subjective 
evaluation of delay and echo suppression in telephone communication. 


II, HISTORICAL BACKGROUND 


The problem of delay became acute during the 1920’s and 1930’s be- 
cause voice-frequency circuits 500 miles and more in length were being 
set up for the first time in loaded cable. The propagation speed of these 
facilities was much slower than that of the open wire used previously, 
and the delay became sufficient to cause objectionable echoes. This in- 
itiated activity which led to the invention of the echo suppressor and also 
to intensive efforts to improve hybrid balance. Echo suppressors are capa- 
ble of minimizing the most serious effects of echoes, but it was soon found 
that they also introduced other difficulties by interfering with the free 
two-way flow of speech. These impairments become more severe as the 
echo suppressor has to cope with longer delays. It was also found that 
delay alone introduces difficulties when it becomes large enough. 

These facts naturally stimulated the development and use of the 
higher-speed carrier systems. Since the 1930’s the use of these systems 
has grown rapidly, and at present practically all circuits of over 25 miles 
transmit at speeds of at least two-thirds that of light. 

Table I shows the delays encountered on typical circuits of 30 years 
ago, on present-day circuits and on those which may be in service in the 
future through the use of satellites. This table clearly brings out the 
reasons for the intense interest in delay in the earlier period, the reduc- 
tion of interest as high speed circuits were introduced, and the renewed 
current interest. 


III. FUNDAMENTAL PROBLEMS 


Problems due to echo and delay fall in three general categories. First 
are those due to the delay alone; second, those due to echo; and third, 
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TABLE I— Amount oF DELAY IN Lona TELEPHONE CIRCUITS 
























































Approx. Signal | Approx. One- 
ois A .~L I a satae PI Og pp - 
Circuit TN Gai hs t Facility bavieed et ai 
(a) Actual Circuits in the 1930’s 
New York to 900 VF on loaded cable 20,000 13 
Chicago (H-44) 
New York to 1,850 93 
Dallas 
(b) Considered in the 1930’s but Not Established 
New York to 3, 200 160 
San Francisco 
(c) Present-Day Circuits 
New York to 260 VF on loaded cable 20,000 13 
Hagerstown (H-44) 
New York to 900 K carrier 110,000 8 
Chicago 
New York to 3, 200 TD-2 186,000 19 
San Francisco 
New York to 4,100 submarine cable and 120,000 35 
London others for cable 
Hawaii to Lon- 10,000 submarine cable + 80 
don 3000 mi of TD-2 
(d) Circuits Involving Satellites 
New York to | 13,000f to | 6,000-mile satellite 186,000 70 to 97 
London 18,000 + TD-2 
49,000 24-hour satellite + 186 ,000 265 
TD-2 
Hawaii to Lon- | 29,000} to | two 6,000-mile sat- 186,000 160 to 210 
don 38 ,000 ellites + 3,000 
miles of TD-2 
98 , 000 two 24-hour satel- 186,000 530 








lites + 3,000 miles 
of TD-2 





* Includes an allowance for terminals 
{+ Length of path varies with satellite position. 











those introduced by echo suppressors. This paper concentrates largely 
on the first two categories, leaving the last to the companion papers. 
When two persons who are separated by a large distance wish to con- 
verse, there are two fundamental factors which must be taken into ac- 
count. In their simplest form, these are illustrated in Fig. 1. This shows 
customer A at the left and customer B at the right connected by a four- 
wire circuit all the way from microphone to receiver (each line in the 
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Fig. 1 — Four-wire toll circuit — no echo. 


diagram actually represents two wires). There will be losses, indicated as 
I, and LZ. , which must be compensated by gains G, and G2 or the two 
will not be able to talk at all. In addition, there will be time delays be- 
tween them, indicated by D; and De , for which no compensation is pos- 
sible. In a practical case, the losses, gains, and delays are roughly equal 
in the two directions; they are not lumped as indicated in the figure, but 
are distributed over the entire circuit. For most long circuits, the net 
loss (L — G) is made positive but close to zero. 

Unlike losses that can be reduced as desired by amplification, delay 
cannot be reduced below an absolute minimum set by nature. This mini- 
mum equals the distance divided by the speed of light (186,000 mi/sec). 
As indicated previously, many of the older transmission facilities had 
speeds well below the speed of light, but current facilities achieve speeds 
closely approaching the theoretical maximum, and hence are minimum- 
delay facilities. 

Delay changes the conversational process in a fundamental manner. 
We shall have more to say about this later, but for the moment it is only 
necessary to point out that when one talker stops talking he cannot hear 
the other reply until a time D,; + De, or about 2D, , later. 

For very long circuits, requiring the use of much gain, the separation 
of the two directions of transmission is economical, and it permits the 
use of high gains with a minimum of complication to avoid singing. On 
shorter circuits, however, it is far more economical to use “‘two-wire”’ 
circuits — that is, the same path for the two directions of transmission. 
Since by far the largest number of telephone circuits are short in length 
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(i.e., customers’ loops and local trunks), the aggregate economic benefit 
of using two-wire transmission on these circuits is very large. In addition 
to the greater economy of the facilities themselves, it is obviously more 
economical from the standpoint of switching (or interconnecting) cir- 
cuits to use two wires instead of four. 

Thus, the usual long distance call today involves both two- and four- 
wire circuits, much as shown in Fig. 2. The long four-wire toll circuits 
are converted to two-wire for transmission over short trunks between 
long distance and local offices, and over the customer’s loop. At the end 
of the loop, the telephone reconverts to four-wire for connection to the 
customer’s transmitter and receiver. 

The conversion from a four-wire path to a two-wire path is accom- 
plished by a hybrid coil. The effectiveness of the hybrid coil depends on 
the degree of match (or balance) between the impedances of the line and 
the network N. This balance is never perfect and therefore results in 
some of the incoming energy being returned as echo //; or /2 . 

It has been found that echo becomes increasingly objectionable as the 
delay is increased. This is because close-in echoes tend to be masked by 
sidetone speech, but the masking effect decreases rapidly after speech 
ceases. cho can be made less objectionable by increasing net loss, but 
loss obviously must not be increased beyond the point where talking is 
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Fig. 2— Four-wire toll circuit operated two-wire has echoes F, and £2. 
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relatively comfortable. It was this situation which led to the invention of 
the echo suppressor. The fundamental point to be observed, however, 
is that increasing the delay results in two serious effects. The first is the 
increase in the response time and the second is increase in the severity 
of echo effects. 


IV. EFFECTS OF DELAY ALONE 


Delays of tenths of seconds are in the region of typical human reaction 
times and can be expected to have important effects on the structure of 
conversation even with four-wire circuits which are completely free from 
echo. 

Fig. 3(a) illustrates a short portion of a typical conversation. The to- 
and-fro speech is shown for a case where there is zero delay in the tele- 
phone circuit. Line A shows the ‘‘talk spurts” TS from talker A at one 
end of the connection. A ‘“‘talk spurt” is broadly defined as a portion of 
speech coming entirely from one talker. 

In line B, the responses of talker B are shown. As is indicated, the talk 
spurts of each party vary considerably in length; and also the response 
times may vary. The ‘response time”? RT, as shown, is defined as the 
interval from the time one party hears the other stop speaking until he 
himself starts speaking. RT may be negative if one party starts before 
the other stops. 

__. TALKSPURTS ~__ 
J en (AVG. 4.1 SECONDS) on x 
/ De 
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‘ h nA eo 
‘ i, \ \ oY \ | | 
\A\ * fog \ 
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Fig. 3— Typical time intervals in a two-way telephone conversation: (a) 
no circuit delay, (b) same conversation observed with added round-trip delay 
of 0.6 second. 
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Talk spurts, as defined, are not necessarily continuous. This is illus- 
trated toward the right of line B by a break between B; and B3’ which 
provides an interval defined as ‘‘resumption time.’’* 

In Table II some data are given showing typical TS and RT times as 
determined some years ago in observations made by A. C. Norwine and 
O. J. Murphy on actual telephone conversations.* These data were taken 
on slightly over 2800 telephone calls over a New York-to-Chicago pri- 
vate line used exclusively for Bell System business. 

It is of interest that there are a rather large number of very short talk 
spurts. Thus the most frequent value (mode) is only 0.25 second. This 
probably represents the usual frequent monosyllabic replies. On the 
other hand, the median TS is two seconds and the average is a little over 
four. There are also occasional quite long talk spurts. 


TasBLE II—Ta xk Spurt (TS) anp Response Tre (RT) 
INTERVALS (IN SECONDS) 





Min. Mode Median Mean Max. 
TS 0.09 0.25 2.0 4.14 143.8 
RT —3.95 0.24 0.32 0.41 5.04 


For response times, the differences between mode, median and mean 
are less but there are a few long pauses and some negative times which 
represent double talking. 

The time diagram in Fig. 3(b) illustrates one way in which large round- 
trip delays can affect conversation. This is a somewhat oversimplified 
model in that it assumes that the delay does not affect conversational 
structure. The added response time of 0.6 second is actually almost 
three times the mode of the normal RT and about twice the median value. 
Thus, if people did not change their conversational habits, this increase 
in response time would substantially add to the amount of time required 
to carry on a given amount of two-way conversation. The percentage 
increase in total time depends also on the length of typical talk spurts. 
Using the figures given in the preceding table, it can be estimated that the 
average conversation might be lengthened by from 6 to 12 per cent if the 
talkers continued to talk in just the same manner as before. 

Experience with long delays, which is covered in the Riesz-Klemmer 
paper,’ however, indicates that the effect of delay is considerably more 
complex than this simple model. Actually, calls tend to be shorter in- 
stead of longer when long delay is present. Apparently there are subtle 


* Some workers in this field define B; and B;’ as separate talk spurts. 
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factors of annoyance and discomfort which cause the talkers to change 
their conversational structure and which make them wish to terminate 
the conversation sooner. 

The simplified illustration, however, shows that even if talkers learned 
to adapt themselves to delay and discipline themselves to wait patiently 
for replies, there still would be degradation. Thus it seems inescapable 
_ that the addition of delays of one-half second or more will substantially 
reduce the true value per unit time of a toll connection. The very essence 
of most telephone calls is the ability to conduct a rapid exchange of 
ideas. This may not apply to some other types of communication, how- 
ever, which involve relatively large amounts of one-way information. 

The illustration is perhaps even more useful in pointing out the possi- 
bility that delay modifies the conversational pattern. As noted previ- 
ously, even with zero delay some response times are negative. That is, 
there is a tendency to start talking before the other user has completed 
his talk spurt. With zero delay, this interruption is noted promptly, and 
the period of ‘double talking” is small. With long delays, however, the 
person who breaks in may talk for some time before the other is aware 
of the attempted interruption. In fact, it may be associated with a later, 
and quite different, part of the talk spurt than that which occasioned the 
break-in. If a response is long delayed, the original talker may resume 
his conversation before it is received, in which case the response to one 
piece of conversational material may appear as an interruption to a later, 
and possibly different, idea. 

Thus, delay not only increases the tendency to double talk, but also 
increases the potential for confusion. As will be shown elsewhere,* these 
characteristics greatly complicate the suppression problem. 


V. ECHOES AND THEIR EFFECTS 


5.1 Sources of Echo 


Any impedance mismatch in a transmission system will reflect energy 
back toward the source and be a potential cause of echo. If there is a 
mismatch at each end of a transmission line, the energy will be repeatedly 
reflected back and forth until dissipated by the line. In a strictly four- 
wire system, these reflections are ordinarily of no moment to the tele- 
phone user, since the amplifiers are one-way transmission devices and 
confine the echoes to small sections of the system with short transmission 
time and sufficient loss to rapidly damp out the reflections. 

Echoes are likely to become important, however, as soon as two-wire 
transmission is employed, since the echo can now be returned with not 
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much more attenuation than is encountered by the direct wave. Even 
though the major part of the system is four-wire, the effect of the two- 
wire portion needs to be considered since it usually controls the magni- 
tude of the echoes. 

As noted earlier, a commercial telephone connection always involves 
some two-wire transmission: a typical situation is illustrated in Fig. 2. 
Within the telephone station, the user’s inherent four-wire transmission 
is converted to two-wire to fit into the exchange plant. At a toll office, 
it is reconverted to four-wire for transmission over long distances, and 
corresponding conversions are made at the far end. In practice, there may 
be other conversions, since four-wire transmission is sometimes used in 
the local plant, but is converted to two-wire for switching. Two-wire 
transmission is seldom employed in the toll plant, but some conversion 
from four-wire to two-wire for switching does occur. We need not con- 
sider these additional conversions for an understanding of the echo prob- 
lem, but it should be appreciated that they complicate its solution. 

Four-wire to two-wire conversions are made by means of a hybrid 
coil circuit. This is a form of bridge in which a network is provided to 
balance the line as shown in Fig. 4. If the impedance of the balancing 
network N is equal at all frequencies to the impedance of the two-wire 
line, the energy from the incoming four-wire branch is equally divided 
between the line and the network and the conversion is accomplished 
without echo. The ‘balance’ in the hybrid circuit is then considered 
perfect. If there is any mismatch between the line and network imped- 
ances, the balance will be less than perfect and some of the energy will 
be transferred to the outgoing branch of the four-wire system. 

Referring to Fig. 2, it will be noted that speech from customer A can 
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Fig. 4 — Occurrence of echo in two-wire local plant. 
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be transferred from one side of the four-wire circuit to the other at either 
his own telephone set, in which case it is called ‘‘sidetone,”’ or at the dis- 
tant, or B toll office. In this latter case, it is called ‘‘talker echo.” 

The principal difference between sidetone and talker echo is one of 
timing. Within the telephone station, transmission is essentially in- 
stantaneous and the sidetone appears concurrently with the talker’s 
speech. The time for transmitting sounds to a distant toll office may, 
however, be many milliseconds. The talker echo may consequently be 
returned with a noticeable delay, in which case he will hear his own words 
reflected back much as if he were talking toward a cliff or other source of 
acoustical reflection. 

Some of the talker echo can also be reflected at office A and back again 
toward the listener at the B office. This is referred to as “listener echo,” 
since it is the distant listener who hears this some time following the 
arrival of direct speech. This process continues until the losses in the 
circulating path reduce the echo below audibility. This is illustrated in 
Fig. 5, reproduced from a paper® by G. M. Phillips. 

It is customary to simplify discussions of echoes, as we have just done, 
by considering the transfer of energy as if it occurs at the hybrid coil in 
Tig. 4 (.e., at the two to four-wire conversion point). In dealing with 
steady-state conditions, this is permissible. The magnitude of the steady- 
state echo is determined only by the relative impedances of the network 
and the line at the frequency of interest, and it can be computed if the 
steady-state impedance of the line is known at all frequencies. In prob- 
lems involving transients, however, it is important to realize that the ac- 
tual reflection occurs at any point or points in the two-wire line between 
the office and the station at which an impedance mismatch occurs. The 
time at which an echo occurs is therefore determined not only by the 
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Fig. 5 — Telephone echo paths in a typical four-wire circuit. 
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transit time to the distant office but also by the transit time between 
the hybrid coil and the point of reflection. 

Reflections do, in fact, occur at many places in a two-wire transmis- 
sion line. Unless transformers have very large mutual inductance, they 
will usually cause some reflection at low frequencies. In voice-frequency 
circuits, variations in cable mutual capacitance and irregularities in the 
spacing of loading coils will introduce many small reflections. Thus in 
practice the echo tends to be a “‘smear”’ of many reflections. 

However, by far the largest irregularity and associated reflection usu- 
ally occur at the telephone station. Voice-frequency telephone lines will 
ordinarily havea characteristic impedance with a negative angle varying 
from a few degrees in the case of loaded lines to as much as 45 degrees 
with nonloaded lines. The magnitude of the impedance may be reasona- 
bly constant with frequency, for loaded lines, or decrease rapidly with 
increasing frequency for nonloaded. The telephone station impedance, 
on the other hand, is largely affected by the nature of the receiver. The 
angle is positive and the magnitude increases with frequency. It is evi- 
dent that this is a very poor match with the line and a large source of 
echo. 

Another important reflection may occur at the local central office. If 
the circuit (loop) from this office to the customer consists of loaded cable, 
it will provide a reasonably good match to the facility connecting the 
local office to the toll office. Where it is nonloaded cable, however, the 
match is poor because of the low magnitude and large negative angle of 
this facility. 

The station and loop mismatch are problems of long standing, and 
there has been continuing effort devoted to their reduction. The problem 
is not so much technical as it is economic. The station and the loop con- 
necting it to the central office are the most numerous elements in the 
plant (the Bell System has over 65,000,000 stations), and methods for 
solving the echo problem by improving their impedance characteristics 
would involve very large aggregate costs because of the large multiply- 
ing factor. Instead, it has proved more economical to reduce echo by 
measures applied to trunk circuits, since they are far less numerous. 

The impedance of the two-wire local plant is a highly variable quan- 
tity. For any one loop, it varies greatly with frequency and also varies 
with the type of set and the length and type of facility. Thus, it is im- 
practical to balance these impedances perfectly, and it is customary to 
select a network which provides the best compromise balance over the 
range of conditions commonly encountered. This network is 600 ohms 
In series with a capacitance of 2 mf in most toll offices, but 900 ohms 
with the same capacitance is used for local offices and some toll offices. 
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The most convenient way to express the amount of echo quantita- 
tively is by using the concept of return Joss, that is, the difference in db 
between the energy delivered to the circuit at a given point and the 
energy returned at this point as echo.* Obviously, the return loss of the 
local plant relative to the compromise network is a very complex matter, 
since it involves both frequency and facility. From the standpoint of 
echo, the frequencies between 500 and 2500 cps are of most importance, 
and it is customary to use a single value ‘“‘average’”’ over this band as a 
description. The effects of the various types and lengths of facility are 
expressed in terms of a statistical distribution of this ‘‘average” return 
loss or more particularly in terms of the mean and standard deviation 
of this distribution. 

Typical values for return loss on telephone connections with com- 
promise networks are a mean of 15 db with a standard deviation of 3 db 
at the toll office. At the user’s local office, the corresponding figures are 
11 and 8 db. The higher mean at the toll office is largely accounted for 
by the attenuation of the toll connecting trunks between the local and 
toll offices. 

Return losses in the toll plant between one toll circuit and another 
are usually considerably higher than these values, and as noted earlier, 
return losses at the occasional points where toll circuits are switched on a 
two-wire basis are usually not controlling in the connection. 

A long and continuing effort has been aimed at improving local return 
losses. So far, it has not proved economical to greatly reduce the unbal- 
ance at the telephone loop and station; however, impedance equalizers 
are used extensively on the toll office end of the exchange trunks giving 
access to the toll plant, and similar arrangements for other parts of the 
local plant are being developed. It appears that these and similar 
measures will achieve small but significant further improvements in 
local return loss over the next few years. 


5.2 Effects of Echo 


Not all echoes are harmful. For example, the telephone user likes some 
sidetone (which is talker echo without delay) because it gives him the 
impression that the circuit is ‘“alive.’”? Recent tests at Bell Labora- 
tories’ have indicated that the preferred sidetone volume is comparable 
to the volume he would like to receive from the distant talker. One 
reason for preferring this much sidetone is the high level at which the 
talker hears his own speech (largely via paths within the head) when he 


* Appendix A discusses the return loss concept at somewhat greater length. 
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covers one ear with a telephone receiver. Obviously, his telephone side- 
tone must be noticeable compared to the head sidetone to obtain the 
desired impression of a live circuit. 

When a telephone user is not talking, he does not have the masking 
_ due to his own voice, and he notices much weaker echoes. Delay which 
displaces talker echoes so that they can fall in silent intervals between 
speech sounds increases the chance that the talker will hear them. When 
the delays are short, the effects are small, since they are not much dis- 
placed from sidetone, and are manifested as sidetone having.a hollow 
sound. With longer delays, the echoes clearly stand out, and, if the round 
trip delay is about one-quarter second and the returned echo is suf- 
ficiently loud (roughly 10 db above sidetone), it becomes so distracting 
that talking is virtually impossible. Even at very low levels, such long- 
delayed echoes are very annoying. 

The annoyance of talker echo was recognized in the early days of long 
distance telephony, and tests to determine the tolerable echo were re- 
ported by Clark and Mathes® in 1925. In 1953, this work was repeated 
with current telephone circuits® with the results shown on Fig. 6. This 
shows the relation between round-trip delay and the minimum loss re- 
quired to attenuate the echo sufficiently to provide a commercially 
tolerable condition in the judgment of an average listener. ‘““Commer- 
cially tolerable”? means that although an echo was discernible it was not 
loud enough to be objectionable. The curve represents the average toler- 
ance to echo of all the listeners, but any individual may differ from the 
average by a considerable amount. About 68 per cent of customers have 
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Fig. 6 — Average listener judgment of minimum echo attenuation necessary 
for just-tolerable talker echo conditions on modern telephone sets. 
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a tolerance to echo lying within +5 db of this average curve. The remain- 
ing 32 per cent would be about evenly divided above and below but 
differing by more than 5 db from the average. 

These tests were made with a typical loop connecting the talker to his 
local central office. The round-trip losses and delays shown are measured 
from the talker’s central office to the point of reflection and return to the 
same central office. In a practical case, the loss in the echo path is made 
up of twice the net loss plus the far end return loss. 

It might be thought that listener echo would be more troublesome than 
talker echo, since the user is then always in the “listening’’ condition, 
and hence does not have the masking effect of his own sidetone to reduce 
the effect of echo. In actuality, it has been found that where the round- 
trip delay is around one-quarter second or more, a given amount of 
talker echo is just about as annoying as the same amount of listener 
echo. Hence, talker echo is controlling on typical telephone circuits be- 
cause the listener echo is always additionally attenuated by a second 
(near end) return loss. 


5.3 Controlling the I’ffects of Echo 


There are two fundamental ways in which echo can be controlled: 
(a) by reducing the delay and (b) by increasing the loss in the echo path, 
ie., controlling echo magnitude. The use of carrier systems with their 
high propagation speeds has reduced the delay on a large part of the 
land line system to the point where echo is negligible. Since the speed 
is now close to that of light, little further reduction of delay is possible, 
and control of echo magnitude must be adopted where length is great 
enough to cause significant delay. 

Control of echo magnitude has, over the years, taken many forms. 
Four-wire circuits and four-wire switching are used very largely in the 
long distance plant and have largely eliminated echoes except from the 
end links.* As noted earlier, it has so far not proved economical to 
reduce greatly the reflection at the telephone station. 

Another way to reduce the effects of echo is to increase the loss be- 
tween the talker and the point of major echo. It is to be noted that each 
db added will decrease the echo by 2 db. Attractive as this may be from 
an echo standpoint, it is obvious that it must not be pushed to the point 
where the direct transmission path is adversely affected. 

* Even though the toll transmission paths are almost entirely four-wire, two- 
wire switching is still used extensively at smaller toll offices. By careful adjust- 


ment of the balancing networks and office wiring, the reflections from the four- 
to two-wire conversions are kept minor compared to the end link reflections. 
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The approaches just discussed are all simple, direct attacks on the 
problem. There is also a slightly more sophisticated approach, the echo 
suppressor, which is discussed in more detail in a companion paper. 
This is a voice-operated switching device which reduces echo by intro- 
ducing loss in the return transmission path of the four-wire system. Other 
sophisticated devices are conceivable. For example, since the return loss 
problem arises from the inability to obtain good return loss from a 
single balancing network facing a large variety of line impedances, it 
might be possible to make available a series of balancing networks from 
which the one giving the best balance for the particular circuit in use 
would be selected. This solution has been looked into quite extensively 
but, so far, the relatively small benefits which might result have not ap- 
peared to warrant the complications of implementation. 

Another proposal frequently advanced has been the use of a self- 
balancing hybrid. Most of these schemes involve the measurement of 
current and voltage associated with the line impedance and the intro- 
duction of compensating currents or voltages in the balancing arm. F. B. 
Llewellyn has shown that this result is theoretically unattainable with 
any linear networks, 1.e., with any arrangement for which the inserted 
compensating voltage or current is instantaneously, linearly related to 
the line value. This formerly unpublished document is reproduced in 
Appendix B with the author’s permission. 

In effect, this proof applies to a single fixed network which might of 
course be a complex device. However, it should be noted that this does 
not outlaw some more sophisticated approaches involving nonlinear 
networks or networks which are adjusted in accordance with some 
combination of the incoming signal and its echo. The latter class of de- 
vice is sometimes called a ‘linear adaptive system.”’ Although such ar- 
rangements may be technically achievable, they too have up to now 
appeared to be too complex and expensive to justify their use. 


5.4 Echo Control in the Bell System 


The use of four-wire circuits, four-wire switching and careful office 
balancing in two-wire offices has essentially eliminated echoes except 
from the end links. Also, some return loss improvement has been achieved 
through impedance correcting networks. Thus, for circuits with round- 
trip delays under about 45 milliseconds, it is possible to assure acceptable 
echo conditions on about 99 per cent of the calls by engineering the long 
distance circuits to have enough loss.® 

When round-trip delay gets beyond about 45 milliseconds, however, 
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the losses required would be unacceptable. Such circuits are operated at 
a low loss with echo suppressors. In general, these devices are used on 
most circuits over 1000 to 1500 miles in length. Under these circum- 
stances, acceptable echo conditions are obtained on most calls, and the 
received volume of direct speech approaches the preferred value. 

In principle, the echo suppressor is very simple and can be illustrated 
by the central echo suppressor invented by Clark and Mathes® in the 
early twenties. This device, shown schematically in Fig. 7, consists 
basically of two voice-operated switches. Speech from A traversing the 
upper four-wire path operates switch X_2, which disables the lower re- 
turn path for the time necessary to prevent the return of echoes gener- 
ated at the B end. Similarly, speech from B operates the switch X, and 
blocks echoes generated at the A end. 

It is apparent that such a device may well interfere with normal con- 
versational patterns. When the device is fully effective against echoes, 
it modifies the transmission system so that it is no longer a full duplex 
link capable of carrying intelligence in both directions simultaneously. 
Instead, it approaches a half duplex system which can be used in either 
one direction or the other but not in both at once. This will interfere to 
some extent with the free two-way flow of conversation, especially in 
those cases noted in Section IV where talk spurts overlap, most often 
caused by one talker trying to interrupt the other. 

The echo suppressors now commonly in use are considerably more 
sophisticated than indicated by the simple model described above. 
Since these devices are discussed in detail in a companion paper, it suf- 
fices here to point out that they have been used with considerable success 
over a period of 30 or more years with the round-trip delays of 50-100 
milliseconds currently encountered. We should also add that an echo 





Fig. 7 — Central echo suppressor; H is slow release time of switch. 
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suppressor is essentially a compromise device. There is theoretically no 
great difficulty in suppressing echoes. The problem arises in accomplish- 
ing this while working with the wide variation in circuit characteristics, 
noise, and speech volume of the commercial telephone system. At the 
same time, there should be little interference with the rapid interchange 
of ideas which gives telephony so much value. Many of the requirements 
tend to be conflicting, and good design involves weighing the relative 
effects of a considerable number of subtle psychological factors. 

Additional problems arise when several echo suppressors are used in 
tandem. Obviously the probability of interfering with normal conversa- 
tional patterns increases with the number of suppressors used on a 
circuit. A particularly serious interference can occur where two or more 
complete suppressors are used with delay between them. In such a case 
it is possible to completely block both directions of transmission. This 
phenomenon is called ‘lock-out.’ 

Lock-out can arise with the situation illustrated in Fig. 8. Let us as- 
sume that A has momentarily stopped talking and the transmission path 
is functioning in each direction. If B starts talking he will first operate 
switch 3 and at time D later he would also operate switch 1. However, 
if talker A resumes talking before time D his path is open to his sup- 
pressor and he will operate switch 2. Now both directions of transmis- 
sion will be blocked and remain so until one party stops talking. Since 
neither one knows that the other is talking the “lock-out”? may continue 
for a long period. 

It should be noted that the important factor in producing lock-out is 
the delay D between Xz and X; . This delay provides a time storage which 
allows each talker to independently capture control of the suppressor 
nearest to him. 





Fig. 8 — Multilink switched connection with two complete echo suppressors — 
lock-out can occur. 
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Lock-out was studied at length by A. W. Horton in the 1930’s.° He 
showed how the probability of its occurrence can be calculated from a 
knowledge of the circuit and speech parameters. 

Some protection against lockout can in theory be provided through 
echo suppressor design, but in the long run it may be more desirable to 
avoid the use of tandem suppressors by suitable switching arrangements. 


VI. CONCLUSION 


In resuming the work on echo and delay, we found ourselves with a 
solid background of fundamental knowledge but were faced with a num- 
ber of new problems. Perhaps first in importance was the determination 
and evaluation of the effect of using delays far longer than those experi- 
enced hitherto. Second, and closely related, was the question of designing 
suppressors to control these larger delays, and particularly the applica- 
tion of modern techniques in the expectation that they would provide 
more effective devices than those designed for the shorter delays. These 
are the main subjects of the papers which follow. 


APPENDIX A 


Further Notes on Return Loss 


While return loss, in principle, can be measured at any point in a 
circuit, it is most conveniently specified and measured at the four-wire, 
two-wire junction. At this point, both the incident and the reflected 
wave can be determined from measurements on the transmitting side of 
the hybrid using a signal source on the receiving side. The relative 
strength of the wave delivered to the two-wire line is determined by 
opening the network: that is, producing a complete unbalance. The 
relative strength of the echo is then determined by connecting the net- 
work and measuring again. The difference between these two measure- 
ments, both expressed in db, is the return loss. If the network exactly 
balances the impedance of the line, there will be no reflected wave, and 
the return loss will, of course, be infinite. 

It should be noted that it is the difference between the measurements 
made with the network leg open and terminated in a network that ex- 
presses the return loss; not the total loss from receiving to the trans- 
mitting side. This latter, or transhybrid, loss exceeds the return loss by 
the losses inherent in the hybrid coil. Referring to Fig. 4, the incident 
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energy at H will be seen to be lower than at B by the coil loss B to E (ideally 
3 db and practically about 3.5 db). Similarly, the reflected energy at C 
is lower than that at # by the coil loss. Thus, the transhybrid loss B to C 
exceeds the return loss by twice the coil loss (about 7 db in practice). 
If amplifiers were introduced to just compensate for the coil losses, the 
difference in levels between A and D would be the return loss. 

Return loss may also be expressed in terms of impedances. At the 
hybrid 


Zi + Zw 


return loss = 20 log Z Z 
L N 








where Z, and Zy are the line and network impedance respectively. At 
any point in the two-wire circuit the return loss is also defined as above, 
where Z;, and Zy are interpreted as the sink and source impedances. 

Return loss is a function of frequency. From the standpoint of echo, 
the frequencies between 500 and 2500 eps are of most importance. Often 
a band of resistance noise covering this band is used as a source for 
measuring “average’’ return loss. 

It will be appreciated that the reflections which give rise to echo can 
also cause oscillation or singing if the gain around the four-wire loop 
equals the loss at any frequency. Usually the high and low frequencies 
are controlling and return loss at the most critical frequency is referred 
to as a “singing” return loss to distinguish it from the loss at the fre- 
quencies important from an echo standpoint. 

Return loss is often used to describe impedance characteristics and the 
uniformity of transmission lines. It is not practical to manufacture all 
cable pairs with precisely the same mutual capacitance; there will be 
some variation with length and from pair to pair. Similarly, it is not 
practical to introduce loading coils at precisely the theoretical spacing; 
some departures will be required to fit the geographical situation. It is 
customary to describe the effects of such variations in the impedance of 
the facility in terms of the “structural return loss.” This is the return 
loss of an infinitely long (or a suitably terminated) practical cable com- 
pared to the impedance of an ideal cable with no irregularities. Typically, 
structural return losses run about 20-25 db for exchange plant circuits. 
Toll grade cable traversing areas which place few restrictions on the 
location of loading coils may have structural return losses of about 30-35 
db. This is much higher than the 11-15 db typical of local return 
losses. , Bose. meh 
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APPENDIX B 


Proof That a Self-Balancing Hybrid Cannot Be Constructed with Linear 
Networks* 


By F. B. Llewellyn 


In hybrid coils, the degree of balance that may be realized between 
the balancing network and the attached load impedance is one of the 
fundamental limitations in the design of telephone systems. From time to 
time, the proposal has been made that some means of producing an 
automatic self-balancing coil should be sought. Various schemes for 
accomplishing this have been suggested. Essentially most of them, in one 
way or another, involve the idea that it should be possible to insert 
vacuum tubes in such a way that they measure the current and voltage 
associated with the line impedance and introduce compensating values 
into the arm containing the balancing network. In this way, it was hoped 
to maintain the hybrid balance regardless of changes in the line im- 
pedance. 

The following offers a proof that this result is theoretically unattain- 
able with any linear networkf whatever, regardless of whether it con- 
tains amplifiers, negative impedances, gyrators or any other linear ele- 
ments, active or passive, that have been conceived in the past or may be 
thought of in the future. Of course, this does not rule out the possibility 
of obtaining operating improvements by certain nonlinear devices, and 
voice-operated echo suppressors provide a good example of such a 
nonlinear device. 

As a starting point in the proof, Fig. B1 on the accompanying sketch 
shows the generalization of the configuration to be considered. The box 
contains the network elements of the hybrid itself while the four pairs of 
terminals represent, respectively, the attached line Z,, the balancing 
network Zy, the transmitting impedance Z7, and the receiving im- 
pedance Zz. Signal generators may exist in all of these except the bal- 
ancing network Zy. Consequently, the figure may be generalized even 
further, as shown in Fig. B2, by allowing Zy to be included as part of 
the network within the box. 


* This proof was prepared as a Bell Telephone Laboratories internal document 
dated October 18, 1950 and is published here with the author’s permission. 

t Note added by author in 1963: ‘“‘With the exception of the line impedance 
Zr, all of the other impedances in the network are taken to be independent of time. 
Though implied in paragraph 1, this point has caused some confusion.”’ 
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Fig. Bl — General prototype of hybrid terminations. 


The conditions for an operative hybrid balance are three in number 
and may be set down as follows: 

1. When the system is driven from Vz (that is, when Vz = Vr = 0), 
the current [7 is zero. 

2. When the system is driven from Vz (that is, when V, = Vz = 0), 
the current J; is not zero. 

3. When the system is driven from V, (that is, when Vz 
the current Ir is not zero. 

The requirement for self-balance is that these three balance conditions 


Vr ae 0), 


Il 


Zn !S WITHIN 


THE BOX 





Vr 
Fig. B2 — Equivalent of Fig. B1 for present analysis. 


2890 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


shall remain in force regardless of changes in the value of the line im- 
pedance Z,. 

Let us see what these conditions mean when applied to the general 
linear equations which describe the external behavior of the configura- 
tion in Fig. B2. These may be set down as follows: 


re I, | 
Z1 + Zr Z12 Z13 Vr 
LZ Zo + Zr L23 Vi 
Z31 239 Z33 + Zr Vr (1) 


where the Z’s with double subscripts are general impedance constants 
representative of the self-impedances and transfer impedances of the 
network within the box. Thus they are independent of the terminating 
impedances Zr, Z,, and Zr. 

When the system is driven from Vz (that is, when Vz, = Vr = 0), 
the current J; becomes, from (1) 


V 
Trp 2 x [ZoZso - Z31( Zoe a= Z1)| (2) 


where A is the determinant of the matrix in (1). According to the first 
of the three balance conditions, this current must be zero, which requires: 


ZuZ33 — Zu(Zoe + Zr) = 0. (3) 


If self-balance is to be imposed then (3) must remain satisfied regard- 
less of changes in Z, . This can be accomplished only if both of the fol- 
lowing are true: 


Z31 = 0 (4) 
2332 = 0. (5) 


To repeat this in words, we have found that the condition for self- 
balance requires that Z3; should be zero, and, moreover, that either Z2 
or Z32 should be zero. 

Let us see what these requirements mean in terms of the second and 
third of the three balance conditions. To do this, we need the equation 
for J, when Vz, = Vr = O and the equation for J7 when Ve = Vr = 0. 
From (1) these are, respectively: 


Tp: 


I 


a [ZsiZos — Za (Z33 ss Zr)] (6) 


i “ [ZaZi2 — Zon(Zn + Zn). (7) 
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In the event that (4) is satisfied, these become 


os — 8 lan Zss + Zn)] (8) 


Ip, = 18 [Zs2(Zu zi Zr) |. (9) 


Then when (5) is satisfied in addition, we see that I,, 1s zero in the event 
that Zo, = 0 and Jz, is zero in the event that Z3. = 0. 


It therefore must be concluded that the self-balance condition cannot 


be satisfied without violating either the second or the third of the three 
general hybrid balance conditions. Consequently, the proof is complete 
that a self-balancing hybrid cannot be constructed with linear networks. 


REFERENCES 


13 
2. 
3. 


Blackwell, O. B., Time Factor in Telephone Transmission, Trans. A.I.E.E., 
51, 1932, pp. 141- 147; reprinted as Bell System Monograph 619. 

Brady, P. bs and Helder, G. K., Echo Suppressor Design in Telephone Com- 
munications, BS.T.J., this i issue, p. 2893. 

Riesz, R. R., and Klemmer, E. T. , Subjective Evaluation of Delay and Echo 
Suppression in Telephone Communications, BS.T.J., this issue, p. 2919. 


. Norwine, A. C., and Murphy, O. J., Characteristic Time Intervals in Tele- 


phonie Conversation, B.S8.T.J., 17, April, 1988, pp. 281-291; reprinted as 
Bell System Monograph 1074. 


. Phillips, G. M., Echo and Its Effects on the Telephone User, Bell Laboratories 


Record, 32, August, 1954, pp. 281-284. 


. Clark, A. B., and Mathes, R. C., Echo Suppressors for Long Telephone Cir- 


cuits, Bell Telephone TP bomtete: Monograph 165, presented at A.L.E.E., 
June, 1925. 


. Noll, A. M. , Subjective Effects of Sidetone During Telephone Conversation, 


to be published. 


. Huntley, H. R., Transmission Design of Intertoll Telephone Trunks, B.S.T.J., 


32, September, 1953, p. 1019. 


9. Horton, Arthur W. Jr., The Occurrence and Effect of Lockout Occasioned by 


Two Echo Suppressors, B.S.T.J., 17, April, 1938, pp. 258-280; reprinted as 
Bell System Monograph 1073. 


Echo Suppressor Design in Telephone 
Communications 


By P. T. BRADY and G. K. HELDER 
(Manuscript received August 9, 1963) 


An elementary echo suppressor is described which consists of a single 
voice-operated switch located at each end of a four-wire toll connection. It is 
shown that this circuit 1s unsuitable for telephone conversations tf there is an 
appreciable delay between the speakers. Modifications and additions to the 
elementary echo suppressor are made in an effort to improve its perform- 
ance. Action in the presence of break-in speech is discussed in some detail. 
The operation of four present-day echo suppressors ts described. 


I. INTRODUCTION 


In a companion paper by J. W. Emling and D. Mitchell,' it was shown 
that the problems due to telephone echoes increase as the propagation 
time between speakers increases. As a result of the recent interest in 
satellite transmission, efforts have been made by several groups within 
and outside of Bell Telephone Laboratories to build echo suppressors 
particularly suited to the long delays encountered in such communica- 
tions. In the first part of this paper, an elementary echo suppressor is 
described so that the reader may become familiar with some of the 
terminology used in this field and problems associated with the use of 
echo suppressors. Then, additions and modifications made to the simple 
echo suppressor to improve its performance are discussed. I*inally, some 
of the more recent suppressor designs are illustrated. The echo suppres- 
sors to be described in the later sections of this paper are the same echo 
suppressors used in the subjective tests reported in a companion paper 
by Riesz and IXlemmer.? 


Il. AN ELEMENTARY ECHO SUPPRESSOR 


2.1 Functional Description 


Assume a telephone circuit of the type shown on Fig. 1 is set up be- 
tween two speakers A and B. The round-trip delay is appreciable, and is 
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Fig. 1 — A simple echo suppressor. 
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shown for illustrative purposes as lumped and equal to 600 msec. Since 
the match of network N to the impedance Z of the 2-wire line at the 
hybrid is never perfect, echo is generated at the hybrids. If no devices 
were used to control the echoes on this circuit, it would be at best con- 
fusing and frustrating for the speakers if they attempted to converse. 
To block the echoes, a simple voice-operated relay or echo suppressor, 
as illustrated, can be installed at each end of the 4-wire portion of the 
circuit. Now, if speech from B appears at point 1 (generally called the 
‘“‘odd”’ side of the echo suppressor), the relay operates to open the line 
completely from A to B at point 2, the ‘“‘even”’ side of the echo suppressor. 
These relays are effective in eliminating virtually all echoes, but they 
introduce other difficulties. 

Assume that speaker B is talking and speaker A is silent. The trans- 
mission path from A to B will be blocked at point 2. If A now begins to 
talk at the same time B is talking (a condition called “‘double talking’), 
it is apparent that A cannot interrupt B’s speech no matter how loud he 
talks. However, A has no way of knowing that his interrupting speech 
has failed to get through to B, and he may think that B is being impolite 
in not responding to his speeeli: This type of degradation would occur 
even if there were no delay between the speakers, and by itself could be 
a serious drawback of this echo suppressor design. 

The delay coupled with the suppressor action causes an additional 
effect which is more pronounced and is very irritating. Assume B talk- 
ing, A silent. At time t) , B pauses momentarily, then at time f + 0.3 
sec, A hears this pause, and immediately interjects a short comment, 
such as “Uh huh,” or “Yes, of course.’’ Because there is no speech at 
point 1 (Fig.' 1), A’s comment gets through to B. Assume now that B 
had resumed talking at time tp + 0.2 sec. At to + 0.6 sec, A’s comment 
arrives at the echo suppressor near B and suppresses B’s continuing 
speech for the length of A’s comment. Then, at to + 0.9 sec, A detects 
a short interruption of B’s speech. 

Two disturbing events have thus occurred. First, B heard A’s comment 
at an inappropriate time, after he had resumed apenas and second, 
A heard a break in B’s speech. Although the second event is intuitively 
the more disturbing one, the first could well cause B to stop talking so 
he could hear the interrupting speech. Normal conversation would then 
come to a halt and be replaced by a frustrating attempt by each speaker 
to establish what the other person was trying to say. This and similar 
sequences could occur many times in a conversation and result not only 
in annoying the speakers but.also in causing them to waste time repeat- 
ing their remarks and commenting on the bad circuit. 
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The intermittent interruptions which the echo suppressors introduce 
are commonly referred to as “‘chopping.”” Chopping is most serious when 
the conversation alternates rapidly between speakers, and is nonexistent 
if only one person is talking. It becomes more objectionable as the delay 
between the speakers increases. 

Methods to reduce this objectionable chopping have been devised 
and will be discussed later, after some characteristics of the simple echo 
suppressor are further described. 


2.2 Characteristics of a Simple Suppressor 


2.2.1 Amount of Suppression 


It has been found that in circuits with long round-trip delays (say 200 
msec and longer) and with echoes at their present-day levels, consider- 
able attenuation must be supplied by the echo suppressors to reduce 
echoes so they are not annoying. Typically, about 50 db or more attenua- 
tion is provided. The present trend is to use some type of solid-state 
switching to introduce the loss, but a relay which short-circuits the line 
is very effective. In most recently designed echo suppressors, the attenua- 
tion introduced is so great that for all practical purposes the transmission 
path is completely blocked. Some suppressors, however, insert only a 
moderate amount of lossif the speech signal at point 1 of Fig. 1 is very 
weak, and insert larger amounts of loss for strong speech signals. They 
employ a “‘variolosser” which inserts a loss whose magnitude is a smooth 
function of the controlling signal. This would appear to be a good method 
of applying suppression since only the amount required is present at any 
time, and hence double talking should be easier. However, the vario- 
losser provides satisfactory echo suppression only when it begins to in- 
sert loss on very weak speech signals. For signals of more typical mag- 
nitudes, at least 50 db suppression is required, and hence the variolosser 
is not much different from a relay. Further discussion of the operation 
of an echo suppressor will be with reference to relay operation, although 
an echo suppressor employing variolossers will be described later. 


2.2.2 Sensitivity 


The local sensitivity of the echo suppressor can be defined as the level 
of a 1000-cps tone, applied at the odd input, just sufficient to cause the 
speech detector to operate the relay. The signal level is specified as a 
power level in dbm working into a 600-ohm load. The local sensitivity 
includes within it an adjustment for the transmission level point at 
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which the suppressor is operated. Since it is much simpler to discuss its 
operation at the zero transmission level point (OTL),* all references to 
sensitivity hereafter will apply to the zero-level sensitivity, i.e., the 
sensitivity at OTL. 

If the speech detector shown in Fig. 1 has a flat frequency response, 
it will generally provide marginally acceptable suppression of long de- 
layed echoes if its 1000-cps sensitivity is about —32 dbm. Greater 
(lower-level) 1000-cps sensitivities are required if the frequency response 
of the detector is weighted in such a way that it is less sensitive to fre- 
quencies below and above 1000 eps. 

It would seem advisable to build a suppressor which has a very high 
sensitivity to assure complete suppression of echoes of all speech sounds. 
But if the sensitivity is too great, the suppressors will operate on noise 
as well as speech, which certainly is not desirable. Recent data on noise 
on telephone circuits indicate that less than 1 per cent of the longer toll 
calls have a noise level at OTL greater than about 51 dbrne.t This 
equates to —37 dbm if the noise is flat over the voice spectrum. Thus the 
sensitivity of an echo suppressor with flat bandwidth should not be 
greater than —37 dbm if operation on noise less than 1 per cent of the 
time is desired. 

The sensitivity for adequate suppression and for minimal noise opera- 
tion depends on the shaping of the dectector sensitivity characteristics. 
Previous studies have shown that a flat bandwidth between 500 to 
3000 cps with reduced sensitivity is preferable for adequate suppression 
to a bandwidth with higher sensitivity at some of the frequencies in the 
band, if both suppressors are set for equal noise operation. The flat 
bandwidth helps to improve the sensitivity for initial consonants from 
many talkers. 


2.2.3 Pickup Time 


By definition, the pickup time is the time required for the suppressor 
to operate after the receipt at the odd input of a 1000-cps signal with a 
power level 3 db greater than the sensitivity. The pickup time should be 
short, since a long pickup time would cause the speaker to hear brief 
spurts of echo. These would sound like short ‘blips’ and could be 


* The zero transmission level point is a point to which all level points in a toll 
system can be referred. It is analogous to citing altitude by referring to height 
above sea level. The zero-level point is at the transmitting toll switchboard of the 
system under consideration. 

+ 51 dbrn as measured on a 3A noise meter with C message weighting. 

ft See Ref. 3, page 1458. 
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annoying even if they were not recognized as echo. A pickup time of 
about 5 msec is satisfactory, but this may produce operation on impulse 
noise, which again is not desirable. However, the bad effects of impulse 
noise operation can be minimized by proper control of the hangover, as 
discussed in Section 2.2.4. 


2.2.4 Hangover 


When the speech signal appearing at the odd input ceases, the sup- 
pression relay should remain operated briefly, for two reasons: 

1. The speech signal may not terminate abruptly, but rather may 
contain low-level energy, such as that provided by a fricative. This 
speech may be below the sensitivity of the suppressor, but it should 
still be suppressed. 

_ 2. The echo may appear at the even side after speech appears at the 
odd ‘side, due to the delay in the telephone circuit between the echo 
suppressor and the near telephone set (commonly called end delay). 

“ To suppress these weak speech endings or delayed echoes, the echo 
suppressor is supplied with a slow release, or hangover time, which holds 
suppression after the speech level at the odd input has fallen below the 
sensitivity. The required hangover is dependent on at least three factors: 

1. the end delay, 

2. the echo suppressor sensitivity, and 

3. the total circuit delay. 

Generally, more hangover is required for long end and circuit delays 
and for lower sensitivities. No one hangover can be said to be acceptable 
in all cases. In the Bell System, where circuit delays of 30-40 msec and 
end delays of 20 msec are typical, the Western Electric 1A echo sup- 
pressors, having a detector with a shaped frequency response and a 
1000-cps sensitivity of —31 dbm, have a suppression hangover of 50 
msec. 

If impulse noise operates the echo suppressor as described above in 
Section 2.2.3, the suppression hangover could chop appreciable portions 
of the speech if full hangover were applied for impulses. Most impulse 
noise is of very brief duration, and therefore the adverse effects of im- 
pulse noise operation can be minimized by providing a deferred or 
variable hangover which reaches full value only for speech sounds longer 
than about 50 msec. 


2.3 Full vs Splut-Terminal Echo Suppressors 


In the suppressor just described the echo suppressor is split, with 
half located on either side of the major delay. The configuration shown 
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in Fig. 2 can also be used. In this case the echo suppressors for both 
speakers are located at one terminal. The echo suppressor indicated as 
1 operates in an identical manner to the one described previously, but 
a very long hangover is required for echo suppressor 2 because the delay 
between speakers causes the echo of party A to appear at the suppression 
point long after the speech appears at the detection point. The combina- 
tion of echo suppressors 1 and 2 is known as a full echo suppressor. A 
full echo suppressor is frequently used for circuits in which the round- 
trip delay is not very great (e.g., 50 msec or less). The hangovers re- 
quired for full echo suppressors on circuits with long delay considerably 
increase the difficulty in conversing.* 


FULL ECHO SUPPRESSOR 







HYBRID 


LONG HANGOVER 
REQUIRED 


Fig. 2— A full echo suppressor installed at one terminal. 


The echo suppressors shown in [ig. 1 overcome this difficulty, and are 
called split-terminal echo suppressors because the functions of the full 
echo suppressor have been delegated to two units, located at the termi- 
nals of the intertoll trunk. Since the full echo suppressor requires only 
one installation, it has obvious economic advantages over the split- 
terminal echo suppressor, but because of its unsuitability for use in 
long-delay circuits, almost all recent design efforts have been devoted to 
developing an improved split-terminal echo suppressor. 


III. DIFFERENTIAL ACTION 


The conversational difficulties introduced by the echo suppressors of 
Fig. 1 arise from the failure to take into consideration the rapid inter- 


* Years ago a full suppressor was sometimes installed midway between the 
speakers, and the long hangover previously required for one echo suppressor was 
divided between the echo suppressors. Such use today is in most cases impractical, 
since many circuits are multiplexed from end to end and revert to voice frequency 
only at the terminals. 
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changes of speech and the occurrence of simultaneous or double talking. 
The simple design which provides for silencing the even side at all times 
that speech appears at the odd side is not sufficient to provide a good 
communication path. A modification must be found which allows the 
echo suppressor to operate in a different mode if “break-in” speech is 
present at the even side at the same time that speech appears at the odd 
side, i.e., when double talking is taking place. 

Ideally, during periods of double talking, the suppression should be 
removed so that free to and fro conversation can take place. Of course, 
when the suppression is removed, the echo is also transmitted with its 
degrading effects. The problems associated with double talking are dis- 
cussed below. 


3.1 Detecting Break-in Speech 


Recognition of break-in speech in the presence of distant party speech 
is a most difficult problem in echo suppressor design, and no completely 
successful solution of this problem has been achieved. The nature of the 
problem is illustrated in Fig. 3, which is a simplified schematic of one 
end of a long distance telephone connection. Speech received from B 
appears at point 1, and speech from A appears at point 2 along with the 
echo of B’s speech. 

A simple speech detector installed at point 1 will easily recognize 
B’s speech, and only B’s speech, since the isolation amplifier prevents 
A’s speech from reaching this point. But there is no point at which A’s 
speech appears by itself, except of course at A’s transmitter which is, 


A SPEECH PLUS. 


BiEGHO” 22 Saag 


\ To B 
a) EVEN VIA DELAY 












1 
\ 2 — 
H | 
1 J 
I t 
1 | 
I | 
! ! 
TELEPHONE ECHO 
SETA HYBRID | SUPPRESSOR | 
| | 
| | 
/ ! | 
A SPEECH PLUS ! 
B SPEECH PLUS 
| ! FROM B 
SOME OF B ECHO | | VIA DELAY 
! <_ 
L 


ISOLATION ae 
AMPLIFIER SB SPEECH 


Fig. 3 — Speech signals at various points in a two-wire to four-wire connection. 
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unfortunately, inaccessible. A’s speech is present at point 3, but B’s 
speech is also present here, traveling not only toward A’s telephone set 
but also reflected from the telephone set toward the hybrid. Further- 
more, point 3 is generally inaccessible. It may be located miles away 
from the echo suppressor and is frequently not permanently associated 
with the same trunk on which the suppressor is installed. It is apparent 
that point 2 is the only point which can conveniently be examined for 
A’s speech. 

A simple speech detector, such as that used for suppression, installed 
at point 2 to detect break-in speech and remove the suppression, will 
operate on echo as well as local speech and is thus inappropriate as a 
local speech detector. However, many properties of the echo are known, 
because the echo is a distorted reflection of the signal at point 1. All 
existing break-in speech detectors compare a signal derived from point 2 
with a signal derived from point 1. This “reference signal’’ derived from 
1 is a measure of what the echo is expected to look like. If the signal at 
2 possesses only those characteristics which the echo would contain, 
that is, if the signal derived from 2 is contained within the reference 
signal, then echo only is assumed at point 2. If, however, the signal at 2 
is sufficiently different from the reference, speaker A is assumed to be 
talking and the break-in detector is activated. A good break-in detector 
should be as sensitive as possible to the break-in speech, but it should 
not operate when echo alone is present. A break-in detector triggered by 
echo alone is said to exhibit ‘false break-in.” 

Many different break-in detectors have had as their basis for design 
the method outlined above. Two of these devices are described in more 
detail. 


3.2 The 1A Break-in Detector 


Tig. 4 is a block diagram of the suppression and break-in scheme of 
the Western Electric 14 Echo Suppressor. (This suppressor is more 
fully described in Section IV.) The speech signals from points 1 and 2 
are amplified, rectified, and applied directly to the differential device as 
illustrated. The return loss at the hybrid has an average value of 15 db 
with a standard deviation of 3 db.! Thus, the echo at 2 is certain to be 
at least 6 db below the signal at 1 if both points are at OTL. The differ- 
ential comparator is therefore adjusted so that if echo of this level or 
less is present at 2, the differential will yield a positive output. When this 
happens, relay 1 blocks the even path at point 4 with a hangover of 50 
msec and also blocks the negative output of the differential at point 3. 
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Fig. 4— Block diagram of a 1A echo suppressor used in split-terminal fashion. 


If speaker A begins to talk, the signal at 2 will occasionally override 
the differential and cause it to have an output on the negative side. If 
this occurs for 50 consecutive msec, relay 1 drops out and removes the 
block which it put in at points 3 and 4. Relay 2 now operates with an 
18-msec hangover, assuring break-in for at least this hangover period. 
Break-in will continue until the differential becomes positive for at 
least 18 msec, in which case suppression will be restored. If no one is 
speaking, the differential has a null output and neither relay will be 
operated. 

The “reference signal” referred to in Section 3.1 is in part the recti- 
fied signal derived from 1, but it also is contained in the hangover of the 
relays. I’or example, if B talks and then stops (A is silent), the echo may 
be delayed in reaching point 2. The echo could operate the differential 
for a brief period (up to about 20 msec) after speech ceases at 1, but the 
50 msec hangover of relay 1 will prevent false break-in. 


3.3 Break-In Detection by Means of Smoothed Rectified Speech 


The majority of suppressors designed in the past few years employ 
a break-in scheme which uses a differential to compare two speech signals 
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which have been full-wave rectified and smoothed, as opposed to the 1A 
differential action which employs negligible smoothing. Fig. 5 shows a 
typical full-wave rectified speech signal which could appear at the odd 
side of the echo suppressor. The echo at the even side of the suppressor 
is certain to be at least 6 db (one-half amplitude) below the speech on 
the even side. It can also be delayed up to a maximum of about 20 msec. 
This ‘“‘worst: case” of echo is shown, full-wave rectified, as the dashed 
line of Tig. 5. Because of the delay, there are many times in which the 
echo signal exceeds the odd speech signal. But if the original signal is 
smoothed with an RC network so the voltage decays to one-half value 
in 20 msec, the echo will never exceed this smoothed signal. The smoothed 
signal is the reference signal and is indicated in Tig. 5. In practice, the 
echo signal is also smoothed, but with a smaller time constant (any 
value less than the time constant of the other). If the signal from the 
even side should ever exceed the reference, the local speaker must have 
been talking, and the break-in detector is activated. The break-in detec- 
tor is completely independent of the suppressor relay operation, as 
opposed to the 1A action. 

A block diagram of the instrumentation is shown in Frig. 6. A negative 
signal of sufficient strength from the differential (a simple voltage adder) 
will trigger the threshold and cause break-in to be indicated. Once 
break-in speech has been detected, the suppression is removed and the 
echo suppressor is in the break-in mode. The manner in which the 
suppressor operates in the break-in mode varies with different suppres- 
sors, and illustrations of various break-in modes of operation will be 
given later in this paper. 
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Fig. 5 — Typical waveforms appearing in rectified speech break-in detector. 
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3.4 Sensitivity and Pickup of the Break-In Detector 


The parameters of the break-in detector will be discussed with refer- 
ence to Fig. 6, with both the odd and even sides assumed at OTL. These 
parameters are not, however, restricted to this circuit but apply to any 
break-in detector. 

The break-in sensitivity is the 1000-cps signal power in dbm applied 
at the even input which will just cause the break-in detector to operate. 
This tone is applied with no signal at the odd side. A typical value is 
—32 dbm re OTL. This sensitivity determines the ability to remove 
suppression during the suppression hangover after speech has disap- 
peared from the odd side. It is also important in recognizing local speech 
just prior to the arrival of distant speech at the odd side. The break-in 
hangover (discussed later) will then prevent the local speaker from being 
immediately cut off. 

The time required to operate the break-in detector or pickup time 
should be fairly fast, say 5 msec, but this is not so critical as suppressor 
pickup time. As shown before, long pickup times on the suppressor » 
cause some echo to be returned. Long pickup times on the break-in 
detector cause initial fragments of speech sounds to be omitted or clipped 
if break-in is occurring when the suppressor is operated. Informal listen- 
ing tests have shown that such front-end clipping is not very objection- 
able even when pickup times are as long as 20 or even 30 msec. 


3.5 Differential Sensitivity 


The differential sensitivity is a measure of the signal level at the even 
side required to operate the break-in detector in the presence of a signal 
at the odd side. A 1000-cps tone is applied to the odd terminal 3 db 
louder than the suppressor sensitivity. Then a 1000-cps tone is applied 
at the even side and is increased in level until break-in is detected. The 
difference in levels between the break-in signal and the odd signal is 
the differential sensitivity. 

For example, assume the OTL suppressor sensitivity is —32 dbm. A 
—29-dbm tone is applied at point 1 in Fig. 6. Break-in occurs when a 
—28-dbm tone is applied at the even terminal, point 2. The differential 
sensitivity is (—28) — (—29) db or +1 db. 

The negative threshold detector requires a fixed voltage difference to 
operate. Its threshold is set to establish the break-in sensitivity by ad- 
justing the amplifier on the even input, and the differential sensitivity 
is established by adjusting the indicated amplifier in the odd input. 
Because the threshold detector operates on a voltage which is the differ- 
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ence between the voltages at the even and odd inputs, if the differential 
sensitivity is 1 db for a —29-dbm signal at the odd input, the differential 
sensitivity will decrease for louder odd signals (but will still remain 
positive). For example, if the odd input is a tone of —15 dbm, the same 
voltage difference at the negative threshold detector will be required to 
overcome this signal, but the voltage ratio will be lower. This is why the 
definition of differential sensitivity must specify the magnitude of the 
test signal applied at the odd input. 


3.6 Break-In Hangover 


The break-in detector operates to remove suppression when the even 
speech is sufficiently louder than the odd speech. Because of this, it is 
apparent that loud speech at the odd input may prevent the break-in 
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detector from operating. In this case the echo suppressor reverts to the 
simple echo suppressor of Fig. 1, resulting in considerable speech mutila- 
tion. However, even when the odd speech volume is much greater than 
the even speech volume, because of the irregular nature of speech there 
are times when the even speech peaks will be sufficient to initiate break-in 
detector action and remove suppression. Shortly thereafter, the odd 
speech will once again be great enough to reinsert suppression. This 
alternate suppression and nonsuppression produces objectionable speech 
chopping. 

To overcome this difficulty, a hangover is usually introduced on the 
break-in device. Once the local talker has broken in, he does not have 
to depend entirely on the differential, since the hangover will maintain 
break-in. Also, if he is talking and speech arrives from the distant 
talker, his speech will not be cut off by the suppressor for at least the 
break-in hangover period. (However, in the circuit of Tig. 6 if the dis- 
tant speaker is talking and the local speaker breaks in, he does not 
have to wait for the suppression hangover to release before suppression 
is removed. The opposite is true in the 1A echo suppressor.) 

If the break-in hangover is very long, practically no chopping will 
occur, but the break-in detector will remain activated when the local 
speaker has finished talking. During many of these times, distant speech 
will be present, and if transmission is allowed on the even side at these 
times, echo will be returned. This also can be very annoying on long- 
delay circuits. 

The difficulties of achieving good break-in ability may now be sum- 
marized: 

1. To avoid false break-in, the differential circuit must be made 
relatively insensitive to signals at the even side because the expected 
echo levels can be fairly high. This will cause chopping of local speech. 

2. To avoid chopping, hangover is introduced on the break-in circuit. 
But this allows more echo to get through after the local speaker has 
stopped talking. 

The break-in hangover is usually adjusted to effect a compromise 
between excessive chopping and echo. A typical value for recent de- 
signs is 150-200 msec. This is much longer than the 18-msec break-in 
hangover of the 1A suppressor. If a very good break-in detector could 
be built, the hangover would not need to be as long, and a better com- 
promise between chopping and echo could be made. Also, if return 
losses could be improved (made greater), the existing break-in detectors 
could have even lower differential sensitivities, which would reduce the 
need for long hangovers. The accomplishment of either of these objec- 
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tives would be of considerable value in improving present-day echo 
suppressors. 


3.7 Asymmetry in E’cho Suppressor Environment 


Little attention has yet been given in this paper to the speech levels 
at the suppressors. If each speaker is a “‘normal’’ talker and has rela- 
tively little loss between his subset and the suppressor, his speech will 
appear at OTL at a long-term average level of roughly —15 dbm. 
There can be wide variations from this level. The speaker may be an 
especially loud or weak talker, or he may be prone (as many people are) 
to holding the transmitter under his chin, or there may be additional 
loss between him and the suppressor. 

Fig. 7 is an illustration of a long-delay circuit in which two loud 
talkers with equal speech volumes are connected. The losses L4 and Ly 
account for all the speech level variations due to the factors described 
above. If L4 equals Lg , then each speaker has the same ability to break 
into the other’s speech. Consider though, the case in which L, equals 
zero and Ly equals 10 db. Speaker B will have more difficulty than A 
in conversing for two reasons: 

1. The echo returning to A is 20 db less than that returning to B 
(assuming equal return losses at both hybrids). Therefore, B will hear 
louder echoes during break-in than will A. 

2. A has 10 db more difficulty in breaking in than B; consequently B 
will hear more chopping. 

If the losses L, and Lz differ by 10 db because of the actual circuit 
configuration, as may happen, the circuit is said to have 10 db asym- 
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Fig. 7 — Asymmetry in an echo suppressor connection; the difference in losses 
D4 and Lp is a measure of the asymmetry. 


2908 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 


metry. An asymmetry of only 10 db is sufficient to often cause one 
speaker to have considerable difficulty, whereas the other speaker may 
hardly notice anything wrong with the circuit. Differences in talker 
levels at the two ends, which may frequently occur, are also causes of 
asymmetry, much as if the physical losses differ. 


IV. TYPES OF ECHO SUPPRESSORS 


Four echo suppressors are described below to illustrate the various 
approaches taken to the problem of designing echo suppressors which 
permit double talking. All of the echo suppressors are similar in that 
they employ a suppression device and a differential circuit. However, 
each has some peculiarity which distinguishes it from the others. 


4.1 1A Hcho Suppressor 


This echo suppressor has been standard in the Bell System since the 
late 1930’s. About twenty thousand are presently in use on long distance 
continental and ocean cable circuits. 

A simplified schematic of the 1A echo suppressor (used as a full echo 
suppressor) is shown on lig. 8.* In the quiescent state with speech in 
neither the odd nor even path, the DA tube current flowing through the 
IM and OM relays is such that OM is operated and EM is not. If the 
tube current decreases, OM releases; if it increases, EM operates. 

Consider speech in the odd path. Part of this speech enters the odd 
amplifier via the hybrid,f is amplified and half-wave rectified, and is 
then applied to the cathode of the DA tube. This makes the cathode 
more positive with respect to the grid, which decreases the tube current 
and releases relay OM. Release of OM removes ground from one wind- 
ing of the OH relay and applies ground to the second OH winding, 
causing it to operate. Operation of OH places a ground return path in 
parallel with the EM relay, which prevents its operation and also re- 
moves ground from the hybrid balancing network N in the even path. 
Network N now balances the input impedance of the even amplifier 
and this provides a high (35 to 40 db) transhybrid loss across the hybrid 
in the transmission path which suppresses the echo. Suppression hang- 
over is supplied by the RC network in one winding of the OH relay. 


* The block diagram shown in Fig. 4 is for the split-terminal suppressor and is 
not a representation of the full suppressor of Fig. 8. 

{+ The echo suppressor hybrids are branching devices for transferring energy 
from the speech path to the suppressor circuitry and should not be confused with 
the terminating hybrids that convert the 2-wire line to a 4-wire line. 
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If speech is present in the even path, the operation is similar except that 
the DA tube current is increased and the EM relay operates, causing the 
EH relay to operate and place loss in the odd path. 

When speech is present in both paths during double talking, the echo 
suppressor operates as follows. Assume initially that speech in the odd 
path occurred first. This speech operates the OM and OH relays, sup- 
pressing the even path with a hangover of 50 msec. If the even speech 
level is less than the odd speech level, the suppression remains in. If 
the even speech level rises above the odd speech level for 50 msec, 
the OH relay releases and the EM and EH relays operate. This removes 
suppression in the even path but places suppression in the odd path 
with a hangover of 70 msec. For about equal-volume talkers, the sup- 
pression will alternate between the two paths as the syllabic or peak 
power points of the speech in both paths alternate. Almost all echo will 
be suppressed, but the speech during double talking will be chopped or 
mutilated. 

Operation in the manner described above occurs with a full suppressor. 
Split echo suppressor operation is obtained by permanently grounding 
the balancing network in the odd path, with the hangover on the EH 
relay set to its minimum value of about 18 msec. Two echo suppressors 
with this modification are used on one circuit — one at each end. The 
operation during double talking is similar to that of a full echo suppres- 
sor. 

The 35-40 db suppression supplied by the hybrid balance is sufficient 
for most circuits today, but the longer-delay cable circuits or future 
satellite circuits require more suppression than this. A modification 
(shown on Fig. 9) was made to the echo suppressor to increase the 
suppression for tests on long delay circuits. The OH relay, instead of 
balancing the hybrid, applies a short circuit across the even path to 
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Fig. 9 — 1A echo suppressor modified for greater suppression. 
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suppress the echo. Two split suppressors modified in this fashion were 
subjectively tested on long-delay circuits at Bell Telephone Laboratories. 
The results of the subjective tests are given in a companion paper.’ 


4.2 BH Echo Suppressor 


This echo suppressor, one of several experimental echo suppressors 
designed at Bell Telephone Laboratories, differs primarily from the 
1A echo suppressor in that its suppression and break-in functions are 
independent of each other, while in the 1A these functions are combined 
into one differential circuit. A simplified schematic of the BH split 
echo suppressor is shown on Fig. 10. 

If speech is present in the odd path, relay S operates and places a 
short across the even transmission path (provided relay B is operated) 
which suppresses the returning echo. Operation of suppression relay S 
is controlled by speech in the odd path only and is independent of 
speech in the even path. (This differs from the 1A where relay OM is 
controlled by speech in both paths as determined by the DA tube cur- 
rent.) Relay S provides a suppression hangover of about 50 msec. 

Speech in the even path is amplified, rectified, and applied to the grid 
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Fig. 10 — BH echo suppressor. 
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of tube D. The negative signal applied to the grid reduces the tube cur- 
rent which releases normally operated relay B. Release of B prevents 
or removes any suppression caused by operation of relay S and also 
removes ground to enable a speech compressor in the odd transmission 
path. (This is discussed later.) Once relay B has released, it remains 
released for a hangover time of 200 msec. 

When speech is present in both paths, during double talking, the oper- 
ation is as follows. Assume that speech in the odd path occurred first. 
This speech operates the S relay suppressing the even path with a hang- 
over of 50 msec. If the even speech level is less than the odd speech level, 
relay B stays operated, and the suppression remains in. Since the even 
speech may be the echo of the odd speech, this speech must not operate 
relay B to remove suppression. Echo is prevented from operating relay 
B by proper choice of gains in the odd and even amplifiers and the RiC; 
time constant. (This was discussed in Sections 3.1 and 3.3 above.) 
When true break-in speech of sufficient level is present in the even 
path, it will release relay B and remove the suppression for 200 msec. 
If relay S is operated when relay B releases, 6 db loss is inserted in the 
even path. Also, anytime relay B is operated, a speech compressor is 
enabled in the odd path. When the echo suppressor is used at OTL, the 
speech compressor is adjusted to supply 0 db loss for a —50-dbm signal 
in the odd path and to smoothly increase this loss to 12 db for a 0-dbm 
signal. The 6-db loss in the even path and the speech compressor in the 
odd path are used to reduce the echo occurring during the double 
talking interval, when relay B is operated and suppression is removed. 

If prolonged double talking is simultaneously present at the echo 
suppressors at both ends of the connection, all echoes would be attenu- 
ated by two compressors and two 6-db pads. This is not generally the 
case, however. Many echoes return at a time when the local talker is 
silent, and they are often reflections of speech generated when there 
was no distant speech. It is more accurate to say that all echoes are 
attenuated by at least one compressor and at least one 6-db pad. 


4.3 GN Echo Suppressor 


This split echo suppressor, also designed at Bell Telephone Labora- 
tories, is unique in that it uses variolossers for suppression and thus ap- 
plies a variable amount of suppression depending on the odd speech 
level. A simplified schematic is shown in Fig. 11.* 

Speech present in the odd path is amplified, rectified, and applied to 


* A more complete description of this echo suppressor is given in Ref. 4. 
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Fig. 11 — GN echo suppressor. 


the variolossers VL; and VL.. The combined suppression loss of VI, 
and VL», is shown on I'ig. 12. It is seen that for very low-level speech 
the suppression slope is moderate, but for higher-level speech the sup- 
pression has a steep slope and rapidly reaches 80 db suppression. By 
proper choice of odd and even amplifier gains and the time constant 
on the odd input to the trigger circuit, echo is prevented from removing 
this suppression. 

During periods of double talking, the operation is as follows. If odd 
speech is present first, suppression is applied. When break-in speech of 
sufficient amplitude then occurs, it is recognized as such at the input to 
the trigger circuit, which operates relay B. Operation of relay B removes 
the suppression of VL. to allow even speech to pass through the echo 
suppressor. It also reduces the suppression of VL; and places a 4-db loss 
in the odd path to reduce the echo. The suppression of VL; during double 
talking is shown on Fig. 12. Relay B provides a break-in hangover of 
about 400 msec. 


4.4 AM Echo Suppressor 


This echo suppressor, an experimental model by a United States 
manufacturer, differs primarily from other echo suppressors in its action 
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during double talking. A simplified block diagram of the AM echo sup- 
pressor is shown on Jig. 18. 

If speech is present in the odd path, threshold detector TD» operates, 
producing a signal passing through gate 1 and enabling the transistor 
suppressor, which has about 60 db loss. Suppression hangover (about 
100 msec) is supplied by hangover control HC. . 

Speech in the even path is amplified, rectified and applied to the differ- 
ential amplifier. If the output of the differential amplifier operates 
threshold detector TD; as described in Section 3.3, gate 1 is inhibited, 
preventing suppression. The inhibit signal has a hangover time of about 
200 msec. 

When speech is present in both paths during double talking, the oper- 
ation is as follows. Assume that speech in the odd path occurred first. 
This speech operates TD2 , suppressing the even path with a hangover 
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of 100 msec. Echo is prevented from operating TD, through use of 
proper time constants in the differential amplifier. When true double 
talking speech of sufficient volume is present in the even path, the 
differential amplifier will cause TD, to operate, removing suppression 
for at least 200 msec. TDs is still operated by speech in the odd path, 
and when TD; and TDz are operated, a 6-db loss is introduced in the 
amplifiers in the even and odd paths to reduce the echo. The operation 
of TD, also effectively inhibits a gate in the odd speech input to the 
differential amplifier. This inhibit signal prevents the release of TD, 
as long as the even speech is greater than the even sensitivity, irrespec- 
tive of the speech present in the odd path. Thus, as long as the even 
speech level does not fall below the even sensitivity for over 200 msec 
(the hangover time of HC), gate 1 will always inhibit suppression no 
matter what the level of the odd speech. This action reduces the mutila- 
tion of even speech but also increases the unsuppressed echo, as dis- 
cussed below. 

Suppose the odd speech maintains an average level 10 db higher 
than the even sensitivity and the echo of the odd speech is reduced by 
an 8-db return loss such that the odd echo is 2 db higher than the even 
sensitivity. This echo will not by itself remove suppression because of the 
comparison action of the differential amplifier. However, if a short 
burst of break-in speech is present in the even path, TD, will operate 
to remove suppression and inhibit gate 3. Now the echo of the odd speech 
is sufficient to maintain TD, on, and gate 1 will be inhibited until the 
echo of the odd speech falls below the even side sensitivity for more 
than 200 msec. This action has been observed to produce considerable 
echo for some combinations of talkers and return loss. 


V. CONCLUSIONS 


In this paper we have discussed some of the design features of a par- 
ticular type of echo suppressor. The philosophy behind the design of 
this type of echo suppressor is that a telephone channel should allow 
as much two-way conversation as possible consistent with proper echo 
control. Thus every effort is made to provide ease of break-in during 
double talking. Unfortunately, this generally results in a compromise 
between having too much echo or too much speech chopping during 
double talking. The echo and chopping are very disturbing to some 
people, and the disturbing effects may, in some cases, outweigh the 
beneficial aspects of attempting to provide a two-way circuit during 
simultaneous talking. 


ECHO SUPPRESSOR DESIGN 2917 


Ease of break-in is not the only principle which can be followed in 
designing echo suppressors. Other approaches have been suggested. 
For example, one echo suppressor has been proposed which is designed 
to train the conversants not to double talk by increasing the received 
volume of the distant speaker whenever the local speaker tries to 
interrupt, thereby ‘‘shouting down” the interrupter. Another proposal 
is aimed at preventing the chopping effect during double talking by 
allowing only one-way conversation at any time, the allowed direction 
being determined by examining which conversant spoke first. 

All of these proposals include voice switching, and all produce trans- 
mission degradations. It is impossible to predict the subjective reaction 
to the degradation introduced by these various proposed echo suppres- 
sors merely by examining the design features of the suppressors. This 
reaction can be determined only through subjective tests, one type of 
which is described in a companion paper. 
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Subjective Evaluation of Delay and Echo 
Suppressors in Telephone 
Communications 


By R. R. RIESZ and E. T. KLEMMER 


(Manuscript received August 9, 1963) 


The effect of transmission delay upon the quality of a telephone circutt 
was investigated using naturally occurring telephone conversations. Round- 
trip delays of 600 and 1200 msec went almost unnoticed at first when no 
echo sources or echo suppressors were present in the circuit. After exposure 
to pure delays of up to 2400 msec, considerable dissatisfaction, indicated by 
rejection of the circuit, developed with the 6GO00- and 1200-msec delays. 
When echo sources and echo suppressors were added to the circuit, some 
dissatisfaction developed immediately for round-trip delays of G00 and 1200 
msec. No adaptation to the effects of delay occurred; in fact dissatisfaction 
increased with experience under certain conditions. 


I, INTRODUCTION 


The preceding two papers describe the nature of the problems intro- 
duced into telephone circuits by delay and echo and the attempted 
solution of these problems through the design of echo suppressors. The 
present paper describes recent determinations of the degradation of 
transmission quality caused by pure delay and delay plus echo and echo 
suppressors. The studies reported were all done by the human factors 
research department of Bell Telephone Laboratories. 

Other groups have studied the effect of delay upon voice transmission. 
These include the research department of Bell Telephone Laboratories, 
Stanford Research Institute, Italian Telecommunications Administra- 
tion and the British Post Office. Unfortunately, none of this work has 
yet been published. In general, these studies have found little degrada- 
tion with pure delay, even for round-trip delays over one second, al- 
though some objection did occur in one experiment with natural con- 
versations with 1410-msee round-trip delay. These same subjects did 
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not find the delay objectionable during structured conversation, how- 
ever. 

All of the experiments described in the present paper were performed 
by introducing experimental circuits into naturally occurring telephone 
conversations. This method was chosen because of the previous finding, 
here and elsewhere, that the subtle conversational difficulties produced 
by delay and echo suppressor action do not often occur in structured 
conversations. 

The general plan of the study was first to evaluate the effect of pure 
delays and then evaluate the effect of echoes, echo suppressors and 
delays for several different echo suppressors described in a companion 
paper.! When it became obvious in the first two experiments that 
continued exposure to long delays had a marked effect upon the user’s 
reaction to shorter delays, a third experiment was conducted in which 
each subject was exposed for some weeks to only a single value of a 
given delay. 

The primary measure of transmission quality in the present studies is 
the percentage of calls on which the circuit was rejected as unsatis- 
factory by the users for normal use. In addition, for some of the calls 
made under each condition, the users were called back and asked about 
the circuit. 


II. SIMULATION APPARATUS: SIBYL 


The simulator called SIBYL, which permits the insertion of experi- 
mental circuits into existing telephone lines, has been previously de- 
scribed by H. D. Irvin? With this device it was possible to introduce 
controlled delay through the use of magnetic disc delay units (Echovox 
Sr.), echo suppressors and return loss to simulate field echo conditions. 
The simulator could be inserted into any call originated by any member 
of a panel of users without letting him know of its insertion. 


III. EXPERIMENT 1: DEGRADATION IN TRANSMISSION QUALITY DUE TO 
ADDED PURE DELAY 


3.1 Apparatus 


For the pure delay tests a complete four-wire network was employed 
as shown in Fig. 1. Artificial sidetone was provided, and loss and noise 
were adjusted to values representative of the standard circuit. None of 
the calls were monitored; that is, complete privacy of conversation was 
maintained. For each call through the simulator, the originating number, 
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Fig. 1 — Circuit used to introduce delay in experiment 1: (a) standard circuit 
between PBX subsets, (b) experimental circuit between four-wire subsets used to 
introduce transmission delay. 


called number and length of call were recorded as well as the time at 
which the call was rejected for the rejected calls. 


3.2 Subjects 


Eighteen members of the administrative staff at the Laboratories 
who called each other frequently were asked to serve as subjects. They 
were told that some of the calls that they originated would be routed 
over a simulated satellite circuit, but they did not know which calls 
would be affected or what the changes in the circuit were. They were 
instructed that if they found any circuit “unsatisfactory for normal 
telephoning” they could dial the digit ‘3’’ without hanging up or break- 
ing the connection, and the standard circuit would be restored. Only the 
calling party was able to reject the circuit. 


3.3 Test Conditions 


Because of the necessity for a complete four-wire connection, the 
delay was inserted only on calls in which one of the subjects called 
another. The simulator was inserted on every such call unless it was in 
use by another pair of subjects. Since the simulator could handle only 
one call at a time, when one pair of subjects was routed through the 
simulator, all other subjects would receive the standard circuit even if 
they called each other. 
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The experiment continued for 12 weeks. The general plan of the 
experiment was to alternate between delays of 600 and 1200 msec each 
working day for 12 weeks. This schedule was changed in the fifth and 
sixth weeks, during which the 1200-msec delay was alternated each 
day with 2400-msec and the 600-msec delay was not used at all. The 
reason for inserting the 2400-msec delay in the fifth week was that 
there were almost no rejections during the first four weeks, which 
raised the question as to whether the subjects were not having any 
difficulty due to the delay or were not associating unusual conversa- 
tional difficulties with the telephone circuit. It was correctly assumed 
that the 2400-msee delay would produce identifiable circuit-related 
difficulty. All but two of the 18 subjects made or received at least one 
call over the 2400 msec circuit during the two weeks it was used. The 
final six weeks at 600- and 1200-msec delay show the users reaction to 
these lesser delays after exposure to the longer delay. The trend during 
this six-week period is particularly important. The negative reaction of 
the users to the longer delay may be expected to generalize to the shorter- 
delay calls which are made in close time proximity. That is, after ex- 
posure to the 2400-msec delay, the users may reject any circuit on which 
they note any characteristics of a delay circuit. If this is the only effect, 
then the rejection rate should jump suddenly after exposure to 2400 
msec and then return (perhaps slowly) to the former level when the 
2400-msec condition is removed. 

If, on the other hand, exposure to the longer delay teaches the users 
to identify conversational difficulties which are also present at shorter 
delays, then the rejection rate should not fall after removal of the longer 
delay. Indeed, it may well be expected to continue to rise as the learn- 
ing continues. 


3.4 Results 


The percentage of calls rejected for each two-week period of the 
experiment is plotted in Fig. 2 for each delay separately. The grouping 
of weeks by twos provides an average of 29 calls per point with a mini- 
mum of 16 calls, whereas single weeks would have as few as six observa- 
tions per point. None of the calls at 600 msec delay were rejected in the 
first four weeks and only 3 of 50 calls at 1200 msec were rejected by two 
different subjects. The 2400-msec delay was rejected more than half the 
time during weeks 5 and 6 when it was employed (18 of 34 calls). In 
the two weeks following exposure to 2400-msec delay there were still 
no rejected calls at 600 msec, but in the final four weeks 138 subjects 
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Fig. 2— Percentage of calls rejected as a function of weeks for each pure delay 
condition separately. Combined data from 18 subjects, all of whom were exposed 
to all delays. Number of observations per point varies from 16 to 37. The 2400- 
msec delay was used during weeks 5 and 6 only. Rejections on standard circuit 
calls were less than 1 per cent. 


made 61 calls at 600 msec and four of these subjects rejected a total of 
17 calls.* The 1200-msee delay showed an increasing rejection rate 
after week 5, and during the last four weeks, 17 subjects made 66 calls 
at 1200 msec and eight of these subjects rejected a total of 31 calls. T 

Because the circuit difficulties introduced by delay are not apparent 
until after some conversation has taken place, it is of interest to ask 
how long the call has been in progress before rejection occurs. The 
median time to dial out the simulator for all rejected calls was 22 sec- 
onds. The median length of rejected calls was 133 seconds, including 
talking time both before and after rejection. For comparison, the 
median length of nonrejected calls was only 94 seconds. Thus, rejected 
calls tend to be longer calls and they are rejected fairly early in the call. 
Lest this finding be interpreted as implying that users are more likely 
to reject calls early that they anticipate will last long, analysis of later 
data shows that such is not the case. 

The rejection rates of the delayed calls may be compared to the 
probability of rejecting a call on the standard telephone circuit. In the 
pure delay experiment it was possible to record attempted rejections of 
standard circuits by the subjects. These numbered 86 in 6688 calls, or 
less than one per cent. 


* These four subjects rejected 1 of 6, 2 of 12, 5 of 11, and 8 of 22 calls respec- 
tively. 

+ These eight subjects rejected 1 of 2, 1 of 5, 2 of 2, 2 of 3, 2 of 7,3 of 4, and 18 
of 21 calls respectively. 
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3.5 Discussion 


The extent to which the 2400-msece exposure, actually experienced 
by 16 of the 18 subjects, influenced the over-all results has not been 
established. It cannot be assumed that the high rejection rate at 1200 
msec would ever have been reached if 2400 msec had not been introduced. 
The rejection rate at 600 msec has undoubtedly been influenced by the 
exposure to 1200 and as well as 2400 msec. The importance of such 
interaction effects was not appreciated until after the more extensive 
tests at experiments 2 and 3. Regardless of the probable influence of the 
2400-msec exposure and the 600-1200 mixture on the results, it is signifi- 
cant that the subjects show no sign of becoming accustomed to delay 
and adapting their conversations to its presence. 

Individuals vary widely in their reaction to delay. One of the test 
subjects rejected 27 of the 36 calls he made at 1200 msec. Since the 
remaining 17 subjects rejected 17 of their 118 calls over the 10-week 
period at that value of delay, it can be seen that this one subject had an 
important effect on the total results. Of course, such strong objectors 
cannot be ignored in planning communications systems. 

Because of the large differences in calling rate and rejection rate among 
the subjects and the small number of calls which could be sampled, the 
absolute levels of rejection rate shown in Fig. 2 cannot be taken as 
representative of larger populations of calls and users. The qualitative 
finding of increasing rejection rate with experience with the delays used 
is clear, however, for several of the subjects who must be assumed 
representative of a significant proportion of the total population. 


3.6 Summary 


Pure delays up to 1200 msec, round-trip, added to a telephone circuit 
did not result in much user dissatisfaction with the circuit for users with 
limited experience in using such circuits. 

After further experience, which included limited exposure to 2400- 
msec delay, considerable dissatisfaction developed for delays of 600 
msec and 1200 msec when these were intermixed. 


IV. EXPERIMENT 2: DEGRADATION IN TRANSMISSION QUALITY DUE TO 
DELAY, ECHO AND ECHO SUPPRESSOR (INTERMIXED DELAYS) 
4.1 Apparatus 


The arrangement of the SIBYL simulator for experiment 2 is shown 
in Fig. 3. It differed from experiment 1 in that normal 2-wire connections 
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were maintained between the telephone sets and hybrid transformers. 
Return loss* was set at 12 db; about 15 per cent of normal long-distance 
connections have a return loss and echo worse than this value. 

The four echo suppressors employed are described in the preceding 
article by Brady and Helder.! They will be designated here, as in the 
previous article, by 1A, BH, GN and AM. Delay was produced by 
Echovox, Sr. magnetic disc units in the 4-wire portion of the circuit. 

For each call through the simulator, the originating number, called 
number, length of call, and time of rejection (if rejected) were recorded. 


4.2 Subjects 


The subjects were 101 employees of the Murray Hill Laboratories 
representing a wide cross section of occupations and ages. None of the 
subjects were working on transmission quality or satellite projects. 
Because some of the subjects shared a telephone, there were only 94 
telephone lines. Any call originating from these telephones could be 
routed through the simulator regardless of its destination. This is 
different from experiment 1, in which only calls between subjects could 
receive the experimental circuit. 


4.3 Instructions 


As in experiment 1, the subjects were not informed as to the nature 
of the degradation which would occur, nor were they cued as to which 
calls were routed through the simulator. They were instructed that 
some of the calls they originated would be routed over a simulated 
satellite circuit, and that if they found any such call unsatisfactory for 
normal telephoning they could restore the standard circuit merely by 
dialing a ‘‘3” without hanging up or breaking the connection. 


4.4 Test Conditions 


In addition to the four different echo suppressors and several delays, 
several experimental circuits were employed which involved adding 
loss or noise to an otherwise standard circuit. This was done to estab- 
lish a relation between percentage of calls rejected and the more conven- 
tional types of circuit degradation. 

On any one day of the experiment only one echo suppressor and four 
delays were employed; on other days the four loss conditions or the 
four noise conditions were employed. There were seven different daily 
conditions and four values of the appropriate parameter within each 
daily condition, as shown in Table I. 
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Taste I — Description oF TrEst ConpitTions USED 
IN EXPERIMENT 2 


Suppressor 1A 50 200 600 1200 msec delay 
Suppressor BH 50 200 600 1200 msec delay 
Suppressor GN 50 200 600 1200 msec delay 
Suppressor AM 50 200 600 1200 msec delay 


Noise 26 32 38 44 dbrn noise* 
Loss I m 6 9 12 15 db added loss 
Loss 2 6 10 15 21 db added loss 


* The noise was a recorded mixture of thermal noise and power hum selected 
to be representative of actual noise on telephone circuits. It was measured at the 
line terminals of the calling subject across his 500-type station set using a Western 
Electric 3A noisemeter with C-message weighting. 


The main experiment lasted for eight weeks. On three days of each 
five-day week, three of the echo suppressors were used; on one of the 
remaining days the noise conditions were used, and on the other day 
the loss conditions were used. The order of suppressors, noise and loss 
was varied so that each of these occurred on different days each week. 
In addition, the sequence of values within days was varied so that each 
value occurred at a different time each day. The values within days 
were changed every twenty calls. Thus, the first 80 calls each day were 
assigned to the four experimental values given in Table I in some 
previously determined order. On most days 100 calls were made through 
the simulator, the final 20 calls being over a standard circuit to provide 
a basis of comparison. As an example of the schedule, suppressor 1A 
might be used Monday with the first 20 calls at 600 msec delay, with the 
second 20 calls at 200 msec, with the third 20 calls at 1200 msec, and 
with the fourth 20 calls at 50 msec; the final 20 calls might use the 
standard circuit. On Tuesday, the four noise conditions might be in- 
serted. 

Whether or not any particular subject received the simulator circuit 
when he placed a call was dependent upon whether or not the simulator 
was in use. It could handle only one call at a time. The other controlling 
factor was that no subject was given two calls on the simulator in im- 
mediate succession. 

Echo suppressors BH and GN were used for the entire eight weeks of 
the experiment. Suppressor AM was not available until the fifth week, 
at which time it was substituted for suppressor 1A, which had been 
used along with BH and GN during the first four weeks. 

Loss 1 (6-15 db range) was employed the first two weeks, during 
which time it became clear that the range was too small, whereupon 
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loss 2 (6-21 db) was substituted for the final six weeks. The noise condi- 
tions were used one day a week for all eight weeks. 

Following the eight weeks of the main part of the experiment, an 
additional three weeks were run in which only one suppressor and one 
delay was used for the entire week. This was done in order to see if 
users would learn to adapt to the circuit changes with more practice 
under one condition. During the last of these additional weeks (week 11) 
the subjects were called back after each rejection and asked about the 
reason for rejection. After the eleventh week, the subjects were in- 
formed that the experiment was over. 

Table II reviews the plan of the entire experiment by weeks. 


TABLE IJ — PLAN oF EXPERIMENT 2 BY WEEKS 


Week of Experiment 





1 2 3 4 5 6 7 8 9 10 11 
1A suppressor | AM suppressor BH at {GN at /BH at 
600 600 600 
BH suppressor msec msec msec 


only only only 
all all all 
week week week 





GN suppressor 





Loss 1 | Loss 2 
Noise 


(For variation within days during weeks 1-8, see Table I. Each horizontal row 
of Table II represents one weekday.) 


4.5 Results 


The percentage of calls rejected at each delay is shown in Table III 
for each suppressor separately. There is little difference among the four 
suppressors. Suppressor AM shows a slightly higher rejection rate than 
the other suppressors, but it was tested only during weeks 5-8 and the 
difference may well be due to an increased sensitivity after training on 
the delay circuits (suppressor BH showed exactly the same number of 
rejects as suppressor AM during weeks 5-8). A three-way analysis of 
variance confirms that there was no statistically significant difference 
in rejection rate among the four suppressors.* Therefore, the data from 
all suppressors are pooled in Fig. 4, which shows the rejection rate as 
the first eight weeks of the experiment. 

"_* Differences between suppressors can be demonstrated by special techniques 
other than natural conversation, as has been shown by M. B. Gardner and J. R. 


Nelson.’ Their acceptability in natural conversations would seem to be the ulti- 
mate criterion, however. 
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Taste II] — Resection Rate ror Eacu SUPPRESSOR 
AND DELAY 


Round-Trip Delay in msec 


Suppressor 50 200 600 1200 All Delays 
1A* 5 15 25 39 21 
BH 7 11 24 41 21 
GN 4 17 22 38 20 
AMt 9 20 36 40 26 
All suppressors 6 15 26 39 


Combined data for first eight weeks of experiment 2. Each table entry is based 
on 80 calls for suppressors 1A and AM and 160 calls for suppressors BH and GN. 
Table entries are percentage of calls rejected. 

* Tested during weeks 1-4 only 

+ Tested during weeks 5-8 only 


The standard condition is shown on Fig. 4 for comparison. Of 766 
calls over the standard circuit, 10 (1.3 per cent) were rejected. 

Fig. 5 shows the rejection rate for each value of added loss and the 
standard circuit with no added loss. All subjects are combined over the 
eight weeks during which the loss conditions were run. 

Fig. 6 shows the rejection rate for each value of noise for all subjects 
and weeks combined. The standard circuit for internal calls had a 6-dbrn 
noise level and is plotted at that point. 
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Fig. 4— Rejection rate as a function of delay: combined data from 101 sub- 
jects using four echo suppressors for a total of eight weeks. All subjects were 
exposed to all delays and suppressors; each point represents 480 calls, except for 
the standard circuit point, which represents 766 calls. 
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CALLS REJECTED IN PER CENT 





ADDED LOSS IN DECIBELS 


Fig. 5 — Rejection rate as a function of added loss: combined data from 101 
subjects. The 9- and 12-db added-loss conditions were run during the first two 
weeks only, and represent 40 calls each. The other points represent 120 or 160 
calls each, except for the zero added-loss point (standard circuit), which repre- 
sents 766 calls. 


4.6 Effect of Experience 


The above figures summarize the data over the first eight weeks of 
the experiment and thus leave unanswered the question about change 
in rejection rate with experience. Do the circuits with delays, echoes and 
suppressors show an increased rejection rate with exposure, as shown in 
Tig. 2 of the pure delay tests? Evidence on this question is presented in 
Fig. 7, which shows rejection rate as a function of weeks of the experi- 
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Fig. 6 — Rejection rate as a function of noise level: combined data from 101 
subjects. Noise was a mixture of thermal noise and power line hum measured 
across the terminal set by a Western Electric 3A noise meter with C-message 
weighting. Each point represents 160 calls, except the 6 dbrn point, mes gebre 
sents 766 calls; the 6 dbrn point is the standard circuit condition. 
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Fig. 7 — Rejection rate as a function of weeks of experiment: data from 101 
subjects using all echo suppressors in use each week. Each point represents 120 
calls, except points for weeks 9-11, which represent 268 or 480 calls each. Only 
the 600-msec delay condition was used on weeks 9, 10 and 11. 


ment for each delay condition separately. The data from all suppressors 
are combined since there was no meaningful difference among suppres- 
sors. The changes in rejection rate over weeks shown in I'ig. 7 are quite 
erratic, but for each delay above 50 msec the rejection rate was higher in 
the second half of the experiment than in the first half. The rejection 
rate at 50 msec actually decreased with weeks, but it should be pointed 
out that none of the changes with experience are statistically significant 
in this experiment. 


4.7 Interview Results 


The results of calling the subjects back after they had rejected a 
circuit is shown in Table IV. This procedure was followed only in week 
11, and included 70 calls. Chopping and echo rank first among types of 
annoyance, which is to be expected from the action of the suppressor. 
Noise and loss comments do not refer to experimentally added noise and 
loss, since these conditions were not employed in the eleventh week. 
The echo suppressor does put in loss during double talking and its ac- 
tion does interrupt line noise originating at the other end of the circuit, 
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TasLe ITV — REsponsEs oF SUBJECTS TO INTERROGATION CALL 
IMMEDIATELY FOLLOWING THEIR REJECTION OF THE 
CIRCUIT DURING WEEK 11 (BH Suppressor, 

600 MSEC DELAY) 


Why did you reject the circuit? 


Echo 47% 
Chopping 36% 
Noise 21% 
Low volume 11% 
Delay 13% 
Would the circuit have been acceptable for a transatlantic call? 
No 51% 
Marginal 28% 
Yes 21% 





A total of 70 interviews was made. 


facts which may account for at least some of the loss and noise com- 
ments. 

Table IV also shows the percentage of responses of several types to 
the question as to whether the circuit they had rejected would be accepta- 
ble for transatlantic calls. (They had been instructed to reject circuits 
unacceptable for normal telephoning.) Almost 80 per cent of the re- 
spondents said the quality would be unacceptable or marginal for trans- 
atlantic service. 


4.8 Length of Calls 


The relation between length of call and re‘ection rate was investi- 
gated by combining the data from all suppressors and delays of 200 
msec or more. For 532 calls lasting 30 seconds or less the over-all rejec- 
tion rate was 8 per cent. For 391 calls lasting 2.56 minutes or more the 
over-all rejection rate was 51 per cent. Another way of looking at the 
relation between length of call and rejection is to consider the length 
of rejected and nonrejected calls. Nonrejected calls had a median length 
of about one minute. Rejected calls have a median length of about 2.5 
minutes and have a median time of rejection of about half a minute for 
the delay conditions. The probability of rejecting a call at any point 
was looked at as a function of the total length of the call. There was no 
relationship; that is, subjects do not show a tendency to quickly dial 
out calls which they know will last long. 

Calls going outside the Murray Hill Laboratory* are longer on the 





*The breakdown of calls by destination was: Murray Hill extension, 62 per 
cent; tie lines, 23 per cent; outside local, 14 per cent; DDD, 1 per cent. 
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average than calls to inside extensions and show a higher rate of rejec- 
tion. There is no meaningful indication that the change in rejection rate 
for outside calls is either more or less than would be expected from their 
longer duration. 


4.9 Discussion 


In interpreting the results of experiment 2 it must be kept in mind 
that all subjects were potentially exposed to delays up to 1200 msec 
each week.* As in experiment 1, we must suspect that this exposure 
influenced the rejection rate at the lesser delays; experiment 3 provides 
data on this point. However, exposure of telephone customers to delays 
up to 1200 msec is not inconceivable in future satellite systems, and the 
present experiment — which gives the average user one such call every 
two weeks — is perhaps a reasonable approximation to one such system. 

Two facts should be borne in mind in extrapolating from the labora- 
tory experiments to field usage. One is that the return loss in the labora- 
tory tests was set at a level somewhat worse than the average field 
expectation. Return losses in operating systems would be expected to 
be equal to or worse than the laboratory value in only about 15 per cent 
of all calls. The other factor is that the average laboratory call is much 
shorter than the average long-distance call; our results show that longer 
calls are rejected at a much higher rate. Thus, the laboratory rejection 
rates cannot be applied directly to all field calls, but do indicate poten- 
tial difficulty in a significant proportion of long distance calls with 
intermixed delays such as those in experiment 2. 


4.10 Conclusions 


(z) Intermixed delays from 200 to 1200 msec produce substantial 
rejection rates under the conditions of the present experiment. 

(77) There was no meaningful difference among the four echo sup- 
pressors tested. 

(zit) There is no evidence of increased tolerance of delay with ex- 
perience nor on long distance calls. 

(iv) Rejection rate increases greatly with length of call. 


* Those people who made many calls did talk over the maximum delay each 
week. Many subjects who used their phone little did not talk over the maximum 
delay each week, but they do not influence the rejection rate as much, either. 
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V. EXPERIMENT 3: DEGRADATION IN TRANSMISSION QUALITY DUE TO 
DELAY ECHO AND ECHO SUPPRESSOR (FIXED DELAYS) 


In the previous experiments, delays up to 1200 msec were intermixed 
each week so that all subjects were exposed to the entire range of delays. 
Experiment 3 modified the procedure of experiment 2 in that no subject 
ever talked over a longer delay than that assigned to his group, and 
separate groups of subjects were exposed to maximum delays of 50, 
200, 400 and 600 msec. 


5.1 Apparatus 


The SIBYL simulator arrangement was the same as that used in 
experiment 2, except that only one echo suppressor was used (suppressor 
BH). Fig. 3 of experiment 2 shows a block diagram of the simulator 
which applies equally well to experiment 3. 


5.2 Subjects 


Eighty employees of the Murray Hill Laboratory served as subjects. 
None had served in any previous simulator experiment. During the 
course of the experiment 24 subjects changed rooms, left or otherwise 
became unavailable. Their data was removed from the analysis. 


5.3 Instructions 


The instructions to the subjects were the same as in experiment 2: 
that is, they were told to dial out any circuit which they found unsatis- 
factory, but they were not told the nature of the degradation and were 
not cued as to which calls were routed through the simulator. 


5.4 Test Conditions 


During the first week of the experiment all subjects were exposed to 
the 50-msec delay condition only. That is, on any call they initiated 
they would receive the simulator with a 50-msec delay unless another 
subject was already using the simulator. After the first week the 80 
subjects were divided into four groups of 20 each, matched approximately 
on calling rate as determined from the first week. The sequence of delays 
to which each group was exposed is shown in Table V. Note that sub- 
jects in each group were never exposed to delays longer than that 
assigned from the fifth through fourteenth weeks. 

Because the delay could not be changed instantaneously when any 
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TABLE V — SEQUENCE OF DELAYS FoR HACH GROUP OF 
EXPERIMENT 3; SUPPRESSOR BH was Usep THrouGHour 
Week of Experiment 


Group 1 2 8 4 5 6 7 8 9 10 I1 12 18 14 #15 16 


1 50—————+ 600 -—-—_______________5 
2 50 200-— —_________—__________» 
3 50 200 400 —————_______________—_—_» 
4 50 200 600 — i _—_—_—____—___—_—_—_———_ 


Table entries are round-trip delay in msec. 


particular subject picked up his telephone, it was necessary to divide 
the 8-hour day into four 2-hour periods and make the simulator, with 
appropriate delay, available to only those subjects who were assigned 
the same delay. The sequence of conditions within each day was rotated 
in such fashion that each group of users had access to the experimental 
circuit at a different time each day. Of course, they were not informed 
of this scheduling. 

Group 1, which had the 50-msec delay for the first four weeks, was 
changed to 600 msec on week five. Since so few calls at 50 msec were 
rejected, it was not considered necessary to continue the 50-msec 
condition beyond week four. Group 4 was changed to 600 msec in order 
to increase the number of subjects at this delay, which is perhaps the 
most important one for questions of satellite communications. In order 
to combine groups 1 and 4 for an equivalent number of weeks at the 
600-msec delay, data from two additional weeks (weeks 15 and 16) 
were taken from group 1. 

The noise and loss conditions of experiment 2 were not used in experi- 
ment 3. Another difference from experiment 2 was that no control was 
exercised over the number of calls made each day under each of the 
experimental conditions. Rather, a time period was set aside each day 
for each condition, as previously described. 

As in the previous experiments, the data recorded consisted of calling 
number, called number, length of call, and time of rejection if any. 

During the eighth and fourteenth weeks the subjects were called back 
immediately after.rejecting a circuit and also were called on a sample of 
simulator calls which were not rejected. They were asked what, if any- 
thing, they had noticed about the circuit on the previous call. 

Finally, after the above schedule had been completed, the subjects of 
groups 1 and 4 were given one week of mixed delays of 200, 600 and 1200 
msec in a fashion similar to experiment 2. This was done to check the 
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possibility that differences in results between experimgnts 2 and 3 
might have been due to a chance difference in subjects. 


5.5 Results 


During the first week of experiment 3, when all subjects were on the 
50-msec delay, 73 of them made at least one call over the simulator. A 
total of 475 calls were made, of which eight, or less than 2 per cent, were 
rejected. Group 1 continued on 50 msec through the fourth week. During 
those four weeks they made a total of 376 calls, of which two, or less 
than 1 per cent, were rejected. In view of the low rejection rate and the 
fact that no calls at all were rejected during the third and fourth weeks, 
group 1 was changed to 600-msec delay on the fifth week. 

The 200-msec delay condition was given to all subjects except group 1 
during the second week and to group 2 only from the third through 
fourteenth weeks. During the second week 13 of 265 calls, or less than 5 
per cent, were rejected. 

The main body of results from experiment 3 is shown in Fig. 8, which 
plots rejection rate as a function of delay. In the solid curve, different 
subjects are represented at each delay; that is, each subject appears only 
at the delay value which was the maximum to which his group was 
exposed, except for group 1, which appears in both the 50-msec and 600- 
msec conditions for the main part of experiment 3. This combined group 
was also given the added week of mixed delays, shown by the dashed 
line. Fig. 8 shows a low rejection rate at 200-msec delay (2 per cent), 
less than 8 per cent rejection at 400-msec delay and 14 per cent rejection 
at 600-msec delay. These figures may be contrasted with the rates from 
experiment 2 shown in Tig. 4 — which are roughly double these values — 
and the rejection rates for mixed delays during the one-week extension 
of experiment 3, which are only slightly below those of Fig. 4. 


5.6 Effect of Experience 


Experiment 1, with pure delays presented in an intermixed fashion, 
showed a large increase in rejection rate with continued experience. 
Experiment 2, with echoes and suppressors added, showed little increase. 
The results of experiment 3 are shown in Fig. 9, which plots average 
rejection rate as a function of weeks of experience at maximum delay. 
The data from the 200-, 400- and 600-msec delay groups are plotted in 
this figure. Data from all groups are combined in such fashion that the 
first point plotted represents the first two weeks each group was at its 
maximum delay (not including the added week of mixed delays to 1200 
msec). 
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Fig. 8 — Rejection rate as a function of delay. Each point represents separate 
groups of 12 to 25 subjects, except that the 50-msec group (group 1) was later 
given 600-msec delay and is included with the subjects initially assigned to 600 
msec (group 4). The light dashed line shows the result of exposing this combined 
group (1 and 4) to the mixed delays of experiment 2 for one week. Each point of 
es eae represents 376-774 calls. Each point of the dashed line represents 

-68 calls. 


Tig. 9 shows a trend toward increasing rejection rate with experience 
on fixed delays, but as in experiment 2 the increase is small compared to 
the pure delay condition in experiment 1. 


5.7 Interview Resulis 


The results of calling the subjects back after they had made a call 
over the simulator are shown in Table VI. This table is in two parts: 
one for the case in which the simulator call had not been rejected and one 
for rejected calls. If the call had not been rejected, more than half the 
subjects noticed nothing different about the circuit; other comments 
were spread over many categories, echo being the largest. If the call 
had been rejected, half the subjects reported objectionable echo and all 
other categories of comments were higher than for the nonrejected calls. 
As in experiment 2, the suppressor will introduce loss during double 
talking which will subjectively make the circuit sound low in volume, 
and it will sometimes chop and mutilate echo, room noise and even 
speech in such a way as to sound like line noise. It is very difficult for 
the user to identify the exact nature of difficulties with those circuits. 


5.8 Destination and Length of Call 


The findings of experiment 2 on destination and length of call were 
substantiated in experiment 3. Long calls are rejected at a higher rate. 
Outside calls are longer and are rejected more than inside calls. 
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Fig. 9 — Rejection rate as a function of weeks at maximum delay. Data from 
all groups of experiment 3 combined as of the start of their respective weeks at 
the maximum delay to which each was exposed (200, 400, 600 msec). The added 
ee of mixed delays to 1200 msec is not included. Each point represents 47 to 163 
calls. 


VI. GENERAL DISCUSSION AND SUMMARY 


The effects of transmission delay upon the quality of a telephone 
circuit are not as obvious to the user as the more conventional degrada- 
tions such as noise, Joss and distortion. Indeed during most of a conver- 
sation over a delayed circuit there is no degradation. When degradation 
does occur it can often be misinterpreted by the user as being due to the 
other speaker. Slow responses, excessive interruptions and complete 
failures to respond (because the question was lost in the circuit) are 
examples of such difficulties. 

It is not surprising, therefore, that in more than half the conversa- 
tions involving pure delays of 1200 msec, or 600 msec plus echo and 
suppressors, the users are not aware that there is anything different 
about the circuit.* 

In the pure delay condition the effect is particularly subtle since echo 
and speech mutilation are absent. Experiment 1 shows that it takes 


* That there is an effect, however, is indicated by the fact that the users tend 
to talk for shorter times over these circuits. The median length of call over the 
600-msee circuit with echo and suppressors is 24 per cent shorter than call length 
on the standard circuit. 


SUBJECTIVE EVALUATION 2939 


Taste VI — RESPONSES OF SUBJECTS TO INTERROGATION 
CALL IMMEDIATELY FOLLOWING A CALL OVER 
THE SIMULATOR 


Rejected Calls 
Why did you reject the circuit? 


Echo 58% 
Chopping 10% 
Noise 22% 
Low volume 18% 
Delay 10% 


Nonrejected Calls 
Did you notice anything different on your last call? 


Noticed nothing 52% 
Echo 23% 
Chopping 7% 
Noise 0 

Low volume 18% 
Delay 11% 





Only data from 600-msec delay (suppressor BH) are included, since rejection 
rates and therefore sample size at lesser delays were too small to be meaningful. 
Percentages are based on 40 rejected calls and 56 nonrejected calls. 


considerable experience, and experience with long delays, before the 
effects of pure delay are recognized and rejected. Because of the limited 
scope of experiment 1 it is not possible to say exactly how much pure 
delay can be tolerated in telephone transmission, but it is clear that 2400 
msec is completely unacceptable even on the first few calls and that 
intermixed 600 msec and 1200 msec are both rejected at fairly high rates 
by users who have been sensitized to delay circuits. 

When echoes and echo suppressors are added to the circuit and delays 
up to 1200 msec are intermixed, even the 200-msec delay shows up with 
a significant rejection rate (15 per cent). 

It might be thought that users would be more tolerant of long distance 
calls than internal calls and therefore that the rejection rates shown in 
all three of the present experiments are pessimistic. Analysis of the long 
distance calls made in experiments 2 and 3 does not support this view. 
In both cases the long distance calls were rejected at a significantly 
higher rate than internal calls. Although the difference in rejection rate 
can probably be attributed to length of call, long distance calls in 
operating systems average longer than those in the present study, and 
overseas calls are considerably longer on the average. 

Experiment 3 showed that the use of fixed maximum delay conditions 
for each subject decreases the user dissatisfaction with circuits of shorter 
delays compared to the intermixed situation. When separate groups of 
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subjects were exposed to different values of maximum delay, the group 
exposed to 200 msec rejected only 2 per cent of the calls, and even at 600 
msec the rejection rate was only 14 per cent, compared to 26 per cent in 
experiment 2, in which delays up to 1200 msec were intermixed. 

Individual users’ reactions to delay, echo and suppressor action dif- 
fered widely. Among each group of subjects there was at least one who 
never rejected any of the simulator circuits. At the other extreme, the 
most critical subject had a rejection rate three to five times the group 
average. There was a slight tendency for the heavy users to be more 
critical, but this is likely due entirely to the increase in rejection rate with 
experience. 

There was no meaningful difference in rejection rate among the four 
echo suppressors tested. None offered a measurable advantage over a 
modified version of the suppressor most commonly used today. 

linally, what do the results of these experiments mean for interna- 
tional communications? From the present experiments it would seem 
safe to conclude that noticeable degradation in transmission quality 
may occur under some circumstances with delays as low as 200 msce 
and with any of the currently available echo suppressors. The circum- 
stances include low return loss and users who have been sensitized to 
the problems created by transmission delay. 

Degradation increases with delay until at 1200-msec round-trip delay 
and any present echo suppressor, more than one-third of the calls were 
rejected as being unsatisfactory under the conditions of these experi- 
ments. An increase in circuit loss of approximately 20 db was required 
to produce the same rejection rate. 

The degradation due to transmission delay in any telephone trans- 
mission system must be weighed against the cost and other degradations 
inherent in alternative transmission systems of less delay. It is clear, 
however, that the influence of delays of 200 msec or more should be 
considered in designing international voice communications systems to 
carry natural conversations. 
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Calculation of the Spin-Axis Orientation 
of the Telstar Satellites from 
Optical Data 


By D. W. HILL 
(Manuscript received July 17, 1963) 


The orientation of a satellite can be calculated if the aspect of two or more 
known reference vectors can be measured with respect to a body-fixed co- 
ordinate frame. On the Telstar satellites one such line may be determined by 
solar aspect cells, but the other is determined by ground observation of 
flashes of sunlight reflected from three mirrors mounted on the spinning 
satellite. The simple geometric considerations are discussed, as are the 
optimum placement of the mirrors, the amount of data to be used, and an 
error analysis. Spin-axis orientation data are used as the basis for steering 
the spin axis by ground command of the current in a coil of wire around the 
equator of the satellite. 


I. INTRODUCTION 


A novel method was devised to determine the orientation in space of 
the Telstar satellites. The method used grew out of limitations on weight, 
power, and telemetry facilities, and consists of three highly reflective 
mirrors and six solar aspect cells attached to the outside of the satellites. 
The mirrors reflect sunlight to a ground observation station,* and the 
solar aspect cells measure the direction of the sun with respect to refer- 
ence coordinates in the satellite. The method has proved successful in 
determining the orientation of the Telstar satellites with sufficient 
accuracy both to deduce the residual magnetic moments of the satellites 
and to correct their precession by means of an equatorial torque coil. 
The remainder of the discussion concerns the details of the attitude 
determination scheme. See Ref. 1 for a description of the photoelectric 
equipment used. We will restrict the discussion to a spin-stabilized 


* The use of mirrors for this purpose was first suggested by D. Gibble of Bell 
Laboratories. 
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satellite and further assume that the precessional motion occurs at a slow 
rate. 

The orientation of a satellite can be calculated if the directions of two 
known lines in inertial space can be measured with respect to a body- 
fixed reference frame. One such reference line can be measured by solar 
aspect cells, and, in the case of a spin-stabilized satellite, another can 
be determined by observing flashes of sunlight reflected from a mirror 
attached to the satellite. Because of the indeterminancy of the angular 
displacement about each reference being observed from the satellite, 
we know only that any given body axis must lie on a cone about the 
reference line. The semivertex angle of the cone is the angle between the 
body axis and the reference line, and is the quantity determined by the 
attitude sensing scheme. Consequently, if two reference lines are known 
with respect to the satellite, the orientation of any desired body axis 
must lie along one of the two lines of intersection of the two cones which 
have been determined. Since the inertial coordinates of the reference 
lines are known, the orientation of the body axis can be calculated. 
Any ambiguity between the two possible orientations can usually be 
resolved by the general knowledge of the observations, and in fact, no 
difficulty was experienced with the Telstar satellites in resolving the 
ambiguity. 


1.1 Mirrors Attached to Satellite 


There are three mirrors attached to the satellite, one being a plane 
mirror approximately 4 X 6 inches in size mounted so the normal to 
the mirror makes an angle of 68° with the symmetry or spin axis of the 
satellite; see Fig. 1. The other two mirrors are each approximately half 
the size of the first and both are mounted so the normal is inclined 95° 
to the spin axis. The two 95° mirrors are 120° apart in aximuth around the 
satellite and provide a coding of the flash train to enable the ground 
station to determine which mirror is flashing. 

If a train of flashes of reflected sunlight is observed at the earth and 
the mirror which is flashing can be identified, we know that the spin 
axis must lie on a cone of either 68° or 95° semiangle about the line 
normal to the mirror at the instant of a flash. This line can be calculated, 
since it lies in the plane of the sun, satellite, and ground station and 
bisects the angle between the incident and reflected rays. 

The angles 68° and 95° were chosen because mirrors placed at those 
angles would afford nearly the maximum number of flash observations 
for the nominal orbit and attitude. Calculations showed that these 
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Fig. 1 — Placement of mirrors. 


two mirrors would provide an average of more than one flash observa- 
tion per night, weather permitting, for the first 55 nights, after which 
there would be no night passes for about one month due to precession of 
the satellite orbit and movement of the sun in the geocentric frame of 
reference. 


1.2 Solar Aspect Determination 


The determination of the sun’s aspect is made on the satellite by six 
solar cells placed on the nearly spherical surface of the satellite. These 
solar cells are placed symmetrically and oppositely pointed on each of 
three orthogonal axes, Fig. 2. Each cell responds to sunlight over one 
hemisphere — 1.e., 27 steradians — giving a current response approxi- 
mately proportional to the cosine between the direction of the sun and 
the axis of the solar cell. Thus no more than three cells are illuminated 
at any one time, and the direction of the sun with respect to the orthog- 
onal triad through the solar cells is found by relatively simple calcula- 
tions using the currents generated by the three illuminated cells. 

The orthogonal triad through the solar cells is located symmetrically 
with respect to the spin axis of the satellite, three axes piercing the upper 
hemisphere of the satellite and their opposite extensions piercing the 
lower hemisphere, the solar cells being attached at the intersections of 
the axes and the shell of the satellite. Thus the three axes in either 
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Fig. 2— Solar aspect cell geometry. 


hemisphere each have a direction cosine of +/ 1/3 with the extension of 
the spin axis in that hemisphere. 

Once the aspect of the sun is determined with respect to the orthog- 
onal triad through the solar aspect cells, the angle between the satellite- 
sun line and the spin axis is thus easily computed. 

In principle this solar aspect scheme is satisfactory, but it does suffer 
from an inherent defect. Fundamentally, the method is an analog method 
depending on absolute values of solar cell current for solar aspect 
determination. Thus any drift, either in the properties of the cells or in 
the data amplifying and transmitting system, will cause inaccuracies in 
the aspect determination. In order to minimize these effects, the cells 
have been preirradiated to lessen the effect of radiation in the satellite 
environment. Further, thermistors are used to measure the temperature 
of the cells and a temperature correction is applied to the solar cell cur- 
rent readings. Finally, two methods of computing the solar aspect are 
used to try to isolate any discrepancies that arise. However, there is no 
way to arrive at a correct aspect determination if the solar cells give 
erroneous readings. The best that can be achieved is to obtain some 
average result by distributing errors in some rational manner, such as 
distributing them in proportion to the currents measured in the’ three 
illuminated cells. ; Mee | 

Consideration was given to correcting for angular rotation of the 
satellite during the time required to read the solar aspect cell currents. 
The six cell currents are measured a nominal 100 microseconds apart, 
so at 3 revolutions/second the satellite rotates 0.54° from first to last 
reading. However, calculations showed that the solar aspect cell data 
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would need an accuracy in excess of the capability of the solar aspect 
cells and data encoder to show these effects. Therefore, no spin correc- 
tion was included in the analysis. 


II, DETERMINATION OF SPIN-AXIS ORIENTATION 


In this section we want to consider the problem of determining spin- 
axis orientation from observational data which give the directions with 
respect to the spin axis of two or more known reference lines. One of 
these reference lines will be the line normal to a mirror attached to the 
satellite at the instant a flash of reflected sunlight is observed at the 
ground station, and the other will be either a second such reference line 
or the line from the satellite to the sun. Since the change in the satellite- 
sun line is very slow, two solar aspect observations are not very useful 
for determining spin-axis orientation, as the two cones would nearly 
coincide and hence yield results of questionable accuracy. Clearly, any 
pair of observations could be used for spin-axis determination, but for 
useful information on the spin-axis precession the observations should 
be made as close together in time as possible. There would be no advan- 
tage in studying the use of three or more observations simultaneously, 
as the three or more cones would in general not have a common inter- 
section because of observational errors. Thus the observed data should 
be used in pairs for determination of spin-axis orientation and any 
averaging should be done by operating on the resulting direction vectors. 


2.1 Reference Line Obtained from Flash Observations 


Let the inertial reference frame used be the x, y, z frame, and let the 
associated unit vectors be 7, j, and &, respectively. At the instant a 
flash of sunlight reflected from the mirror attached to the satellite is 
seen at the ground station, the sun, satellite, and ground station co- 
ordinates in the 2, y, 2 frame can be obtained by well-known methods. 

Let the vector R, from the earth mass center to the sun be (see Fig. 3) 


R, = xt + yd + ach. 

The vector R, from the earth mass center to the ground station is 
Ry = aot + yt + 20k 

and the vector R; from the earth mass center to the satellite is 


R, = Ut + Yo) + abe 
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7 MIRROR 





(R5-Rp =V,) 
SUN 


GROUND 
STATION 







GEOCENTER 
Fig. 3 — Reference frame for calculations from mirror flashes. 


The vector R; in the direction of the light ray incident to the mirror is 
R, = R, — R,. 

Since the solar parallax is only a few seconds of are even from the 
satellite, we could approximate R; by —R,, but this approximation 
will not be made, as very little simplification results. 

The vector F, in the direction of the reflected ray is 

R, = R, me Ry . 

Let us adopt the symbol & to denote a unit vector, and the symbol 

R to denote the absolute value of a vector. The incident and reflected 


unit vectors are 











R; = 24+ Yih + zik 
R, = at + yd + 2k 
where 
vy — Xs Yo — Us &b — &s 
i= Ui => ra = 
% Bee Re R: 
Ly — & 1 — Yo 2g — &b 
= a an ae a ra 
1 1 
R; c= [(ae — Le)" a (yp pas Ys)” + (2 7, 26)" : 


1 1 
~y R. E os Re (asty + Yeo + zas) | 
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R, = [(2y — 2)” + (yy — yo)? + (2 — %)F 
= [Ry + Re — (apr + yoyo + 2020)]' 
R, = [ve + ye + 2,7], distance from geocenter to sun 
R, = earth radius at ground station 
Ry = lee ty + ze |} distance from geocenter to satellite. 


The vector V, normal to the mirror is thus given by the expression 
V; = R, = R; 
since V,; must bisect the angle between the incident and reflected rays 


and be normal to the mirror. 
If we let 2, y1, 2: be the direction cosines of the unit vector V;, we 


can write 


Vi = %t+ Yj + ak 











where 
, Ur 7 Ui 
vw = 

Vi 
Vp — Yr 

YS V 

1 
er Fr Kang 

41 = V, 


Vi = Uae — 2)? + Ce — a)? + Ge — 2) 
= R 2( tpt; ci Uri ig z,2i)]'. 


The vector V; is our desired reference vector and can be computed 
explicitly by the above formulas for any time at which a flash is ob- 
served. 


2.2 Second Reference Line 


We will let our second reference line be represented by the unit vector 
V2 
Vo = tet + yof + zeke. 


The vector V2 could represent the satellite-sun line in the case of a 
solar aspect determination, it could represent a second reference line 
determined from a flash observation, or it could be any other reference 
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line whose orientation is measured with respect to the satellite body 
axes as well as in the 2, y, z system. 


2.3 Spin-Axis Orientation 


First we let V; be the unit vector in the direction of the spin axis. If 
we observe a flash from the satellite and can identify the mirror which 
reflected the flash, then we know the angle between the normal to the 
mirror and the spin axis, say a. Similarly, if we determine the sun’s 
aspect with respect to the spin axis or if we observe a flash from a second 
mirror, we know that the spin axis makes an angle, say 6, with our 
second reference line. These two conditions give us sufficient information 
to determine the direction of the spin axis. If we know the spin axis 
makes an angle a with V; and an angle 8 with V2, the spin axis must 
therefore lie along one of the two intersections of a cone of semiangle a 
about V, and a cone of semi-angle 8 about V2. In general we do not 
know which intersection is the correct location of the spin axis, but we 
should be able to resolve the ambiguity by the general knowledge of the 
situation. 

We can express the above statements analytically by the following 
set of equations 


Vi-V3 = cosa = x3 + YYs + 212s (1) 
VieVs = COS B = Vor3 -+- Y23 -}- 2923 (2) 
V3: V3 =l= te ae Ys" + os (3) 
The solution of (1)—(8) is found to be 
A B 
v3 = aah (4) 
—(AB + A’B’) + [(AB + A’B')’ 
a > (A? i C* aie A”)(B + C + B”)} (5) 
ie Be + (2 + B? 
/ / 
Ae A ey) (6) 
C 
where 


A = 2CO0S a — 2, cos B 
B= Ayo — 241 


A’ = 22.008 a — 2; cos 8 
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, 
B! = xXyY2 — Loy 
CS 20%] — 2%. 


Given the direction cosines of the two reference vectors V, and V; 
plus a and 8, we can use (4)-(6) to calculate the direction cosines of 
the spin axis. The plus and minus square root term in (5) is due to the 
two possible locations of the spin axis. 


III. CONSIDERATION OF ERRORS 


It is of interest to know the effect of observational errors on the 
calculated orientation of the spin axis. In the following the significant, 
sources of error are discussed and the effects of these errors on the 
orientation of the spin axis are calculated. 

Since we know the positions of the sun, satellite, and ground station 
to within a small fraction of a degree of arc, we can reasonably assume 
that the direction of V2 is known with negligible error and that the 
uncertainty in V; is a fraction of a degree that does not exceed the errors 
in the observations themselves. Our main sources of observational error 
are in the determination of the aspect angles a and 8, so we will consider 
the effect of small errors in a and @ on the orientation of the spin axis. 
The principal errors we can expect in @ and 8 are 

1. Error in measuring the angle a between the spin axis and the 
normal to the mirrors. This is a mechanical error of approximately 
-+1/2°. There is also the possibility of some wobble in the spin motion, 
although the nutation damper should minimize wobbling. 

2. Error in determining the direction cosines of the sun with respect 
to body axes. This gives an error in the other cone semi-angle, 8. This 
error has been about -++1/4° on most passes, but has been considerably 
worse on others, possibly due to interference from earth-shine. Since the 
sun subtends an angle of 1/2° at the satellite, not much increase in 
accuracy can be expected with the solar aspect system. 

3. Error in determining the center of the sun’s image on the ground, 
iLe., ascertaining the time of the center of the train of light flashes. This 
error has the effect of changing the direction of the reference vector V, 
by + 21/4°, since the semi-angle the sun subtends at the ground station 
is 1/2° to 3/4°, depending on the flatness of the mirrors. We will simplify 
our error analysis by considering the effect of this error to be an addi- 
tional error in a. 

4, The angular errors of V; resulting from the position ephemeris for 
the satellite would enter into the error analysis in a fashion similar to 
item 3. 
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3.1 Analysis 


First let ao and By be the values of the cone semiangles that determine 
the true orientation of the spin axis, and let Vo be the unit vector in 
the direction of the true spin axis. Similarly, let a and 6 be the observed 
values of the cone semi-angles that determine the orientation of an 
estimated unit vector V. We will assume that a and @ differ only slightly 
from ay and & , respectively, so that V and Vp differ only slightly. Once 
we have determined the reference vectors V, and V» and the cone semi- 
angles a and 8, the orientation of the spin-axis unit vector is calculated 
from (4)—(6). We are assuming that V; and V2 are known with negligible 
error and are considering only the effect on orientation of errors in a 
and 6, so %, Yi, 2, and 2%, Y2, 2 are fixed quantities. The two unit 
vectors corresponding to the true and observed spin-axis orientation are 
thus 


Vo(ao, Bo) = tot + Yoo + zeke 
Via, B) = ct + yf + ek. 


We let 5V be the small vector difference between V and Vo, so we 
can write 


V=V,+ av. 


For small changes of a and 8 we can approximate 6V by the first- and 
second-order terms of a Taylor series. 











V = Vo + 6V 
, , OV av 10°V ; 
~ = sot Fyfe eae 6 
Yo a Oa a=ag oe F 0p a=ay p as 2 0a? a=ag “ 
B=8o B=Bo B=Bo 
10°V 2 , OV 
-_~ — 68° —_—— 6a 6p. 
TD pet ww aos wee 
B=B8o B=Bo 








The small angle between V and Vo, denoted e, is found from 


V-Vo = COS € 
sO 
, (av | av 
1+ Pol tt tH ben) 





B=8o B=Bo 


OPTICAL ATTITUDE DETERMINATION 2953 


eV ‘ av 


ba? + — — 
, + a=ag 0a 0p 
B=8o 


1. (av 
wei 0p? 
B=Bo 











a=ag 
=P0 


+ 0(da°, 58°) 


= COS € 
é 4 
= 1 ~ 39 + 0 (e ). 


The effect of variations in 6a and 68 on cos « is of second order, so the 
term containing first derivatives of V in (7) vanishes, as can be readily 
verified. Thus for e we find 


, (Vv 
2 
ove — 0. ( SF 


The reference vectors V; and V2 define a plane, so there is no loss of 
generality if we choose a & axis in the V2 direction and let V; lie in the 
&, 7 plane where we transformed from the 2, y, z axes to & 7, ¢ axes. 
The details of the transformation are unnecessary, since we seek only 
magnitudes of error in orientation and not direction. If é , é, and é; 
are the unit vectors in the &, 7, ¢ directions, respectively, then 


qa 
56° + 2 aV 


av 
ba” + — 
7 ss a=ag Oa 0g a=ag 
B=Bo B=Bo 


, 0B 


=a9 
B=8o 


da 68. (8) 











Vi = cosy & + siny & 
Vs = é 
where 
Vi: V2 = COS Y 


i.e., y 18 the angle between the reference vectors V, and V;. We infer 
from (1)-(3) that the unit vector V in the direction of the spin axis 
has the direction cosines 


—& = cos8 
_ coSa — cos y cos B 
ak sin y 
275 
ae a — cos? p — (208 — 0087 008) ) 
sin y 


where a and @ are the angles between the spin axis and V; and V2, 
respectively. 
After rewriting (8) with the help of &, , ¢ we obtain for « 
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sin a sin 
€ —_ % 0 Bo 5 (9) 
| nes (x Y — COS ao Cos Pe) 


sin ao sin Bo 


Sak — 2 (oo82 — COS ap COS ®) ba 88 + 58° 
2 —— 


provided 


COS Y — COS ap COS Bo 


: : = ae (10) 
sin ao sin Bo 


The restriction (10) merely assures that the two cones have two 
intersections, 1.e., 


vy > | ao — Bo| 
y¥ <ao + Bo. 


Vanishing of the denominator in (9) occurs for the singular instances 
when the two cones intersect in one line only or when they coincide. 
In these situations no solution exists if either ao or Bo is varied, and so 
the expression for ¢ is not defined. 

There are four examples which are of interest, and these are examined 
next. 


3.2 Solar Aspect and Mirror Observation 


Usually the two reference vectors determined will be the satellite-sun 
line and the normal to the mirror, angles a and 8 being the angles be- 
tween these reference lines and the spin axis. 
~ Let V2 be directed toward the sun. The satellite is intended to be 
aligned nearly perpendicular to the ecliptic plane, so we will set By = 
90°. Equation (9) becomes 


2 
€ = 


ba 0 ba 06 eae? 
Sin ao 


2 
=) 
SIN. ao 
and since there are two mirrors on the satellite having normals which 


are inclined 68° and 95° to the spin axis, a) can take on only the values 
68° and 95°. The variation in e with changes in y is displayed in Fig. 4. 


€ 
\dal 


» THE ERROR IN THE SPIN AXIS NORMALIZED 


-- OA, = 68, Bo =90° 

—j\—-— AX, = 68", By =95° 
\——|--&_ = 95°, Ao = 90° 
\—-—-\-- &_ = 95°, Bo = 95° 
\ -\ —+-—%p = 68%, Bo = 68° 


Ho =685 89=68°— 
Xo = 685 Bo= 95*-— 
Xo = 685 9 = 90°— 
Oo =95°, By =95°— 
X= 95°, By = 90°— 


a 


WITH RESPECT TO J& AND Jf 
Lh 
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Fig. 4 — Variation of geometric errors as a function of y. 
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3.3 Two-Mirror Observations 
The three possible combinations of two mirror observations are 
ay = 68°, Bo = 68° 
ay = 68°, Bo = 95° (or vice versa) 


ay = 95°, By = 95°. 


The results of varying y for these three possible combinations, among 
others, are also shown in Fig. 4. 
The curves of Fig. 4 show that if the angle between V; and V4 is be- 
tween 50° and 130° then 
lel < +2 Vea? + 58°. 
If 6a = 68 and 50° < y < 130°, then 
e < 2.828 da. 


Thus if the observational errors in a and 6 are 1°, the error in deter- 
mining spin-axis orientation would be less than 2.828° provided , the 
angle between V; and V2, satisfies the above inequality. 


IV. RESULTS 


The motion of the spin axis of the Telstar satellites has been derived 
from observations of flashes of sunlight reflected from the mirrors 
attached to the satellite and from use of telemetry data from the solar 
aspect cells mounted on the satellite. Figs. 5 and 6 and Table I sum- 
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Fig. 5 — Motion of spin axis in geocentric coordinates. 
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Fig. 6 — Motion of spin axis in polar coordinates referred to ecliptic plane. 


marize the early precession of the Telstar I satellite.* The portion be- 
tween passes 16 and 72 covers the period of time the torque coil on the 
satellite is known to have been activated (presumably from orbits 65 
thru 70). 

The accuracy obtained to date appears to be much higher than was 
originally anticipated. It is estimated that the angular error in deter- 
mining the spin-axis orientation is less than 1/2° for all observations 
except pass 199. The orientation determined for pass 199 is questionable 
due both to poor geometric relations and to many inaccurate frames of 
telemetry data. Some more detailed information is given below for two 
of the fixes reported. 

No high-frequency precession or ‘‘coning”’ has been detected so far, 
so apparently the nutation damper is operating satisfactorily. 

* Right ascension is measured eastward from vernal equinox inthe earth’s equa- 
torial plane; declination is measured plus or minus from earth’s equatorial plane; 


latitude is measured plus or minus from ecliptic plane; longitude is measured 
eastward from vernal equinox in the ecliptic plane. 
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Taste J — Sprn-AXIS ORIENTATION 














i Date Ces Pesce Declination? Latitude® Longitude? 
7, 8,9 7/10 0849 81.96 — 65.57 —86.60 13.22 
16 T/12 0214 84.39 — 65.81 — 87.62 15.52 
72 7/18 0553 86.86 — 66.10 — 88.67 18.55 
135 7/25 0328 91.22 — 65.86 —89.19 125.95 
199 8/1 0345 93.41 — 64.58 — 88.18 139.87 
272 8/9 0341 100.08 — 64.51 — 85.36 158.43 











Sensitivity checks have shown that timing mirror flashes to the nearest 
second is accurate enough. 


4.1 Attitude Calculations 


The times at which mirror flashes have been observed are the basis 
for attitude calculations; early observations for the Telstar I satellite 
are listed in Table IT. 


4.1.1 Passes 7, 8, 9 


Two mirror flashes were observed on cach of passes 7, 8, and 9, thus 
providing a fix for cach of these passes. The accuracy of these three 
fixes was not sufficiently great to permit determining the motion be- 
tween passes, so all three were averaged to obtain a single fix. The 
motion between successive passes is about 0.03°, probably below the 
precision of the present methods to resolve, although additional efforts 
were made on the data for passes 7, 8, and 9 to try to obtain such pre- 
cision. 

The results of various combinations of mirror flashes are given in 
Table JII to show the scatter obtained. This exercise demonstrates that 


TaBLE IJ — Mirror F'LAsH OBSERVATIONS 

















Observation No. Pass No. Date beh ee Mirror Observed 
MI C 7/i1 0235 + 26 68° 
M2 7 7/1 0250 + 47 68° 
M3 8 7/11 0518 + 08 68° 
M4 8 7/ll 05385 + 36 68° 
M5 9 7/\1 0817 + 22 68° 
M6 9 7/11 0821 + 30 95° 
M7 16 7/12 0214 + 19 68° 
M8 72 7/18 0553 + 26 68° 
M9 135 7/25 0328 + 07 68° 
M10 136 7/25 0609 + 31 68° 
Mil 199 8/1 0345 + 37 68° 
M12 272 8/9 0340 -+ 58 68° 
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TaBLE III — Resutts ror Passes 7, 8, AND 9 





Mirror Observations Right Ascension® Declination® 
M1 & M2 82.09 — 65.73 
M3 & M4 82.59 — 65.70 
M5 & M6 81.00 — 65.23 
M1 & M5 82.97 —65.74 
M2 & Md 81.27 —65.25 
M3 & M6 81.77 —65.77 





adequate attitude determination is possible from mirror data alone when 
solar aspect readings are unavailable or considered unreliable. 


4.1.2 Pass 135 


Mirror flashes were obtained for each of passes 135 and 136 and were 
used to obtain fix number 1 given in Table IV. Additionally, all the solar 
aspect data for pass 135 was combined with the mirror flash for pass 135, 
and 17 selected aspect readings averaged to obtain fix number 2. The 
average of these two was used for the pass 185 attitude fix. 


4.1.3 Remaining Observations 


All other attitude fixes reported here were obtained by combining one 
mirror observation with all the solar aspect telemetry data for the pass 
during which the mirror was observed. 


4.2 Residual Magnetic Moment 


The average residual magnetic moment has been determined approxi- 
mately by comparing the observed spin axis precession against theoretical 
predictions. The value obtained is 0.5 ampere-turn-meter?, which is of 
the same order of magnitude as the pre-launch estimates. 


V. CONCLUSION 


The conclusion drawn from these results is that it clearly is possible 
to track the motion of the spin axis with mirrors and solar aspect cells, 


TasLeE ITV — Resutts For Pass 135 








No. Right Ascension® Declination® 





1 91.10 — 65.80 
2 91.33 — 65.97 
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and that controlling the spin-axis orientation slowly by means of a 
torque coil is feasible. Much more extensive tests of the procedure 
have been performed than described here, which cover the remaining 
life of the Telstar I satellite and the Telstar II satellite. The computa- 
tional procedure has been modified so that no data from the solar aspect 
cells are required. In general, the resulting attitude angles fall well 
within the error estimates given in Section 3.8. A report on this work is 
being prepared by L. C. Thomas. 
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On the Spectrum of Optical Waves 
Propagated through the Atmosphere 


By D. C. HOGG 
(Manuscript received July 25, 1963) 


It is well known from our day-to-day experience that distant objects 
appear to shimmer, especially on still, hot days when relatively large 
temperature and humidity gradients may exist. It is true, however, 
that refractive index gradients are in the air to some degree at all times; 
that is, they may be present at night and also when the air is disturbed 
by the winds. It is not surprising therefore that the power received at 
some distance from an optical source fluctuates randomly and possesses 
a characteristic low-frequency power spectrum. 

The purpose of this note is to discuss typical low-frequency spectra 
resulting from propagation of 0.63-micron radiation over a 2.6-km path. 
A vertically polarized helium-neon maser! of power output 10 mw and 
a reflecting telescope of 9-cm diameter comprise the source. With all 
modes of the maser propagating, the beam spreads to an ill-defined 
and ever-changing disk of about 25-cem diameter.* The receiver is a 
refracting telescope of 5-cm diameter with associated filters, polarizers, 
and attenuators which feed a photomultiplier; it is located in the dense 
central region of the transmitted beam. The beamwidth of the receiver 
is large compared with that of the transmitter. For measurement of the 
power spectrum, the output of the photomultiplier is taken to a wave 
analyzer whose bandwidth is 4 eps. 

A typical power spectrum is shown in Fig. 1, the measured points 
being indicated by open circles.f The abscissa, F = f — f., 1s sideband 
frequency, and the ordinate indicates relative received power. On this 
scale the direct-current level is 133 units; thus.the power in the lower 
frequency components is about 13 db below the de level. The curve 
has been extrapolated from 10 cps to 1 cps. These data were obtained 
with the full aperture of the source telescope illuminated. _ 

At this point it is well to note that when the source is brought within 


* Dependent upon weather conditions. 
+ Similar data, using an incoherent source, were reported by V. I. Tatarski 
in Ref. 2. 


2967 


2968 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1963 
































RELATIVE RECEIVED POWER,P(F) 





























4 6 810 20 40 60 100 200 400 600 1000 
FREQUENCY, F,IN CYCLES PER SECOND 


Vig. 1 — Low-frequency spectrum: A = 0.63 yw for a 2.6-km path length. 


the laboratory housing the receiver and the same measurement procedure 
is followed, no such spectrum is observed. 

Three theoretical distributions, P(F), shown as curves in Fig. 1, 
accompany the experimental data. Curve A is a Gaussian, B an ex- 
ponential; in C, P(F) is of the form (1 + KF?)-1. In all cases, the theo- 
retical curves have been fitted to the experimental data at F = 30 eps, 
where the spectrum has fallen to one-half its very-low-frequency value. 
The exponential appears to best represent the data. 

Fig. 2 shows the effect of change in source beamwidth on the width 
(and shape) of the spectrum. For curve 2, the source beamwidth is 
about ten times that for curve 1. One notes that the half width increases 
from 40 to 120 cps with that increase in beamwidth. From this measure- 
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Tig. 2 — Low-frequency spectrum: A = 0.63 » for a 2.6-km path length. 


ment we deduce that the fluctuation rate is roughly proportional to 
the square root of the beamwidth. It is interesting that the two spectra 
appear to coalesce at the higher frequencies; however, in interpreting 
these data, one must recall that the near field of the source extends well 
toward the receiver in the case of curve 1. 

Of course, when one listens to the detected envelope of the optical 
wave, it is heard as low-frequency audio noise. 
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A Relation Between the Basis 
Functions of Periodically Varying 
Nondissipative Circuits 


By SIDNEY DARLINGTON 
(Manuscript received August 7, 1963) 


This note relates to circuits of linear, periodically time-varying, 
positive capacitors and inductors. More specifically, it notes a property 
of the basis functions, or natural modes which describe the transient 
behavior of the circuits. Some other properties of the basis functions are 
described in Ref. 1 in this issue. 

In accordance with Ref. 1 


IT=(S+ pCp)?, lL = pe | (1) 


where J and £ are column matrices, or vectors representing the excita- 
tion currents into and the response voltages at the various nodes, and 
® is an auxiliary vector variable. The matrices C and S are the node 
matrices of the capacitances and of the reciprocals of the inductances. 
They are symmetric and at least positive semidefinite. We will assume 
that C is positive definite. If it is not so originally, a similar equation 
with a positive definite C can be derived from (1), for example by the 
transformations described in the Appendix of Ref. 1. The symbol p 
represents differentiation (not frequency) and operates on all quantities 
following it. 

Setting J = O in (1) makes it a homogeneous, second-order, vector 
differential equation in, say n dimensions, with periodically varying 
coefficients. Thus the well-known Floquet-Poincaré theorem requires 
the solution to be as follows 


2n 
f= De Kee Yo = H,(te*’. (2) 
poe o=1 


Here the k,’s are arbitrary scalar constants and the g,’s are the basis 
functions, or natural modes. The characteristic exponents s, are scalar 
constants. The coefficients H, are time-varying vectors. When the 
s.’s are all different, the H,’s vary periodically. Otherwise, they are at 
most polynomials in ¢ with periodically varying vector coefficients. 
The real parts of the s,’s indicate the damping of the basis functions. 
When the inductors and capacitors are fixed, all basis functions are 
undamped. However, when the components vary periodically some of 
the basis functions may have nonzero damping (as is well known). 
Some bounds on the damping are derived in Ref. 1. It is also known that 
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the sum of all the characteristic exponents must be zero (when the 
circuit is nondissipative). 

The purpose of this note is to point out the following: Corresponding 
to any nondissipative circuit of pertodically varying positive inductors 
and capacitors, any characteristic exponents which have nonzcro real parts 
occur in equal positive and negative pairs. Since complex exponents must 
occur in conjugate pairs, it follows that all the s,’s must fit into pairs or 
quadruplets of the following sorts: 


+ Lay, +a; +a, + toy, (3) 
—twy,? —a;’ +a, — ton 

— Aap, — LW, 

— Ap, + Lap, » 


The theorem follows at once from certain general properties of ad- 
jointly related differential equations, which are stated below without 
derivation. The solution of the nonhomogeneous equation (1) may be 
expressed in terms of functions of the excitation time, 7, and the re- 
sponse time, ¢t. The adjoint equation is the equation whose solution 
corresponds to interchanging the two times, 7 and ¢. It can be shown 
that the characteristic exponents of the adjoint equation are the nega- 
tives of those of the original equation. It can also be shown that equa- 
tion (1) is self-adjoint. (The equation is its own adjoint.) Thus the 
negatives of the characteristic exponents of the original equation are 
also the characteristic exponents of the original equation. 

When the components are fixed, vectors J and F/ are related by a set 
of odd rational functions of a frequency variable. Then @, whose deriva- 
tive is /, is related to J by even rational functions. It appears that, more 
generally, self-adjoint differential equations are useful counterparts, 
for time-varying circuits, of even rational functions in the theory of fixed 
circuits. (See also Ref. 2.) 

Recall Foster’s canonical fixed nondissipative one-ports, comprising 
series- or parallel-connected subcircuits of one or two components each. 
The grouping (3) suggests that there may be a time-varying counterpart, 
in which the most complicated subcircuits correspond to quadruplets 
of + complex characteristic exponents. However, although the existence 
of the grouping (8) is necessary for such a configuration, it does not in 
itself prove, or even support a strong conjecture, that the configuration 
is, In fact, a canonical periodically varying nondissipative one-port. 
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A 5-Gigacycle Tunnel Diode 
Oscillator with 9-Milliwatt 
Output from a Single Diode 


By M. V. SCHNEIDER 


(Manuscript received September 24, 1963) 


This brief paper describes a tunnel diode oscillator which has been 
built by using a single diode in a low-impedance strip transmission line. 

Tunnel diodes were made from p-type zinc-doped gallium arsenide 
with a doping level of 7 X 10" carriers per cubic centimeter. Peak cur- 
rents in the range of 150-300 ma are obtained by alloying tin pellets 
with a diameter of approximately 1 mil to the gallium arsenide wafer. 

A new type of mount has been used in order to reduce the package 
reactance as well as the contact resistance of the diode in the transmission 
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Fig. 1 — Tunnel diode mounted in balanced strip transmission line. 


B.S.T.J. BRIEFS 2973 


TABLE I— DIMENSIONS OF BALANCED STRIP TRANSMISSION LINE 





Ground plane spacing b = 6 mils 
Width of center conductor w = 100 mils 
Thickness of center conductor t = 1 mil 
Distance diode to RF short A = 100 mils 
Impedance of strip transmission line Z = 2.8 ohms 





line. The mount is shown in Fig. 1. A hole with an 8-mil diameter is 
made in the 1-mil center conductor of the strip transmission line. The 
gallium arsenide wafer is alloyed to a 8-mil gold wire containing 3 per 
cent zinc. The diode is inserted into the hole, and the pellet is soldered 





Fig. 2 — Complete tunnel diode oscillator. 


to the tinned edge of the center conductor. A small drop of epoxy is 
used to lock the diode into the hole. The gold wire connects the upper 
and lower parts of the ground plane after the line is assembled. 

It is advisable to have a radio-frequency short close to the diode in 
order to prevent the oscillator from multimoding. The distance between 
the diode and the RF short determines the frequency of oscillation. The 
short shown in Fig. 1 consists of a symmetrical step in the ground plane. 
The center conductor is insulated from the ground planes by an evapo- 
rated oxide film or a thin sheet of mica. This section serves at the same 
time as a bypass capacitor for the de supply. The bias resistance consists 
of a thin nichrome film evaporated on Mylar. 

The dimensions of a particular assembly are given in Table I. 

A special line transformer is used for matching the low-impedance 
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TABLE JJ —IiLEctTRicAL PARAMETERS OF GALLIUM ARSENIDE 
TUNNEL DIODE 





Peak current Ip, = 210 ma 
Valley current I, = 12 ma 
Peak voltage Ey = 225 mv 
Valley voltage E, = 560 mv 
Junction capacitance C = 12 pf 











structure into a 50-ohm coaxial line. The impedance transformation is 
obtained by increasing the ground plane spacing b from 6 mils to ap- 
proximately 300 mils. This type of transformation leads to a structure 
with relatively low losses. A broadband launcher is used for the transis- 
tion from balanced strip transmission line to coaxial line. 

The complete oscillator is shown in Tig. 2. An output power of 9.1 
mw at 4.85 ge was obtained from a single diode. The electrical param- 
eters of the diode are listed in Table II. 

The junction capacitance has been measured in the valley by using 
the techniques developed by D. IE. Thomas.! Several attempts were 
made in order to determine the series resistance of high-current diodes 
in a transmission line with the dimensions given in Table I. It was found 
that the DeLoach method? gives very satisfactory results for relatively 
large ground plane spacings (10 mils and above); however, measure- 
ments at smaller ground plane spacings are at the present time not 
possible because of increasing line losses and because of launching prob- 
lems. | 

Several oscillators were built with diodes having peak currents in 
the range from 150 ma to 250 ma. The output power ranged from 6 mw 
to 10 mw and the frequency of oscillation from 3.5 ge to 5.5 ge. No 
degradation effects have been observed. One particular oscillator has 
been running continuously for two months with a power stability of 
better than +2 per cent, measured with a temperature stabilized power 
meter. 

The author expresses his thanks to R. Neeld, who has developed the 
necessary techniques for assembling and mounting devices with an 
over-all size of several mils. 
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